CF15 



• o 




Date of Deposit iVIay 24. 2000 



Certificate of iVIailing 

Label Number: . 



Fl B23fi26704US 



I hereby certify under 37 CFR 1.10 that this correspondence is being deposited withjhe United States^s^tal Service as 
"Express Mail Post Office to Addressee" with sufficient postage on the date indt^ate^above^^pdls addressed to: BOX 
PATENT APPLICATION, Assistant Commissioner for Patents, Washington, D.C.^0237 



Printed name of person mailing correspondence 



Signature of person mailing correspondence 



UTILITY PATENT A 


PPLICATION TRANSMITTAL UNDER 37 CFR §1 .53(b) 


Attorney Docket Number 


01997/198007 


Applicant 


H. ROBERT HORVITZ, JUNYING YUAN, SHAI SHAHAM 


Title 


CLONING, SEQUENCING AND CHARACTERIZATION OF TWO 
CELL DEATH GENES AND USES THEREFOR 


PRIORITY INFORMATION: 


This application is a divisional of and claims priority from United States patent application 08/984,178, 
filed December 3, 1997, which is a continuation of patent application 08/287,669, filed August 9, 1994, 
which is a divisional of patent application 07/979,638, filed November 20, 1992, which is a continuation- 
in-part of 07/897,788, filed June 12, 1992. 


APPLICATION ELEMENTS: 


Cover sheet 


1 page 


Specification 


66 pages 


Claims 


1 5 pages 


Abstract 


1 page 


Drawing 


21 sheets 


Combined Declaration and PGA, which is: 

□ Unsigned; 

□ Newly signed for this application; 

^ A copy from prior application 08/984,178 and the entire 
disclosure of the prior application is considered as being part 
of the disclosure of this new application and is hereby 
incorporated by reference therein. 


3 pages 


Copy of Change of Address from parent application 
08/984.178 


1 page 


, Statement Deleting Inventors 


[**] pages 


Sequence Statement 


[**] pages 


" Sequence Listing on Paper 


[**] pages 


Sequence Listing on Diskette 


n disk 



\\Ntsen/er\documents\01997\01 997.1 98007 Divisional Application Transmittal 37 CFR 1.53(b).wpd 



Small Entity Statement, which is: 

n 1 InQinnpH' 

□ Newly signed for this application; 

□ A copy from pnor application [ bbKiAL rsiuiviDtK j ana 
such small entity status is still proper and desired. 


[**] pages 


Preliminary Amendment 


3 pages 


IDS 


[**] pages 


Form PTO 1449 


[**] pages 


Cited References 


[**] references 


Recordation Form Cover Sheet and Assignment 


[**] pages 


Assignee's Statement 


[**] pages 


English Translation 


[**] pages 


Certified Copy of Priority Document 


[**] pages 


Return Receipt Postcard 


1 


FILING FEES: 


Basic Filing Fee: $690 


$690.00 


Excess Claims Fee: 4 - 20 = 0 x $18 


$0.00 


Excess Independent Claims Fee: 4 - 3 = 1 x $78 


$78.00 


Multiple Dependent Claims Fee: $260/$130 


$0.00 


Total Fees: 


$768.00 


la Enclosed is a check for $768.00 to cover the total fees. 

□ Charge [**AMOUNT**] to Deposit Account No. 03-2095 to cover the total fees. 

□ The filing fee is not being paid at this time. 

H Please apply any other charges, or any credits, to Deposit Account No. 03-2095. 


CORRESPONDENCE ADDRESS: 


Kristina Bieker-Brady, Ph.D. 
Reg. No. 39,109 
Clark & Elbing LLP 
176 Federal Street 
Boston, MA 02110 


Telephone: 617-428-0200 
Facsimile: 617-428-7045 








/ ' Date 



\\Ntserver\documents\01997\01997.198007 Divisional Application Transmittal 37 CFR 1.53(b).wpd 



\\Ntserver\documents\01997\01997.198007 Divisional Application Transmittal 37 CFR 1 .53(b).wpd 



1 



PATENT 

ATTORNEY DOCKET NO. 01997/198007 



Certificate of Mailing 

Date of Deposit: May 24. 2000 Label Number: EL6236267Q4US 



I hereby certify under 37 CFR 1.10 that this correspondence is being deposited with the United States Postal 
Service as "Express Mail Post Office to Addressee" with sufficientm)stage on thp^dSfeih^icated above and is 
addressed to: Assistant Commissioner of Patents, Washington^^ZTC. 2^3 1 . 





Printed name of person mailing correspondence Signature of person mailing correspondence 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: H. Robert Horvitz et al. Art Unit: 
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Title: CLONING, SEQUENCING AND CHARACTERIZATION OF TWO 

CELL DEATH GENES AND USES THEREFOR 

Assistant Commissioner of Patents 
Washington, D.C. 20231 

PRELIMINARY AMENDMENT 
Kindy amend the application as follows. 
In the specification 

Page 1, line 9, after "is" insert --a divisional of pending USSN 08/984,178, filed 
December 3, 1997, which is a continuation of USSN 08/287,669, filed August 9, 1994, 
now abandoned, which is a divisional of USSN 07/979,638, filed November 20, 1992, 
now abandoned, which is—. 
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In the Claims 

Cancel claims 1-4, 7-15, 17-18, and 20-68. 
Amend claims 5, 6, 16, and 19 as follows. 

5, [Isolated] An isolated protein encoded by [the DNA of Claim 1] a ced-3 nucleic 

acid. 



6. [Isolated] An isolated protein having the amino acid sequence of Figure 4 [Seq. 
ID # 19)] (SEP ID NO: 19) . 

16. [Isolated] An isolated protein encoded by [the DNA of Claim 8] a ced-3 
nucleic acid comprising a mutation . 

19. [Isolated] An isolated protein encoded by [the DNA of Claim 17] an isolated 
nucleic acid comprising: 

(a) a nucleic acid which is structurally related to the ced-S nucleic acid sequence of 
SEP ID NO: 1 8. wherein the polypeptide encoded by said nucleic acid is hydrophilic in 
nature and has a serine rich region: 

(b) a nucleic acid which is functionally related to the ced-3 nucleic acid, wherein 
said functionally related nucleic acid encodes a protein that causes cell death, wherein cell 

-2- 



death is measured by the ability of said nucleic acid to complement ced'3 or ced'4 
mutations in an in vivo or in vitro bioassay: and 

f c^ a nucleic acid which is both structurally and functionally related to the ced-3 
[gene] nucleic acid as described in fa) and (b) . 



REMARKS 

The present Preliminary Amendment cancels claims 1-4, 7-15, 17-18, and 20-68, 
and amends claims 5, 6, 16, and 19. No new matter has been introduced by the present 
amendment. If there are any charges, or any credits, please apply them to Deposit 
Account No. 03-2095. 



Date: 



Respectfully submitted, 




Clark & Elbing LLP 
176 Federal Street 
Boston, MA 02 110 
Telephone: 617-428-0200 
Facsimile: 617-428-70452 

01997.198007 Prelimitiaiy Amendment, wpd 
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Work described herein was supported by grants 
5 GM24663 and GM24943 from the U,S, Public Health 

Service, The U,S, Government has certain rights in the 
invention. 



Related ATDPlication 

This application is a continuation-in-part of USSN 
10 07/897,788; entitled "Cloning, Sequencing and 

Characterization of Two Cell Death Genes and Uses 
Therefor" by H. Robert Horvits, Junying Yuan, and Shai 
Ijl Shaham, filed June 12, 1992. The teachings of USSN 

j^'l 07/897,788 are incorporated by reference. 

15 Background 

Cell death is a fundamental aspect of animal 
O development. Many cells die during the normal develop- 

ment of 'both vertebrates (Glucksmann, Biol. Rev. 
£ Camhridge Philos . Soc. 25:59-86 (1951)) and inverte- 

20 brates (Truman, Ann. Rev. Neurosci. 7:171-188 (1984)). 
These deaths ^appear to function in morphogenesis, 
metamorphosis and tissue homeostasis, as well as in the 
generation of neuronal specificity and sexual 
dimorphism (reviewed by Ellis et al-, Ann. Rev. Cell 
25 Biol. 7:663-698 (1991)). An understanding of the 
mechanisms that cause cells to die and that specify 
which cells are to live and which cells are to die is 
essential for an understanding of animal development. 
The nematode Caenorhahditis elegans is an 
3 0 appropriate organism for analyzing naturally-occurring 
or programmed cell death (Horvitz et al , , Neuroscl. 
Comment. 1:5(^-55 (1982)). The generation of the 959 
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somatic cells of the adult C. elegans hermaphrodite is 
accompanied by the generation and subsequent deaths of 
an additional 131 cells (Sulston and Horvitz, Dev. 
Biol. 82:110-156 (1977); Sulston et al,. Dev. Biol ^ 
5 100:64-119 (1982)) • The morphology of cells undergoing 
programmed cell death in C. eiegrans has been described 
at both the light and electron microscopic levels 
(Sulston and Horvitz, Dev. Biol. 82:100-156 (1977); 
Robertson and Thomson, J* Emhryol , Exp. Morph. 57:8 9- 

10 100 (1982) ) . 

Many genes that affect C. elegans programmed cell 
death have been identified (reviewed by Ellis et ai., 
Ann. Rev. Cell Biol. 7:663-698 (1991)). The activities 
of two of these genes, ced-3 and ced-4, are required 

15 for the onset of almost all C. elegans programmed cell 
deaths (Ellis and Horvitz, Cell 44:817-829 (1986)). 
When the activity of either ced-3 or cBd-'4 is 
eliminated, cells that would normally die instead 
survive and can differentiate into recognizable cell 

2 0 types and even function (Ellis and Horvitz, Cell 

44:817-829 (1986); Avery and Horvitz, Ceil 51:1071-1078 
(1987); White et al., Phil. Trans. R. Soc. Land. B. 
331:263-271 (1991)). Genetic mosaic analyses have 
indicated thit the ced-3 and ced-4 genes most likely 

25 act in a cell autonomous manner within dying cells, 
suggesting that the products of these genes are 
expressed within dying cells and either are cytotoxic 
molecules or control the activities of cytotoxic 
molecules (Yuan and Horvitz, Dev, Biol. 138:3 3-41 

30 (1990)). 

Summary of the Invention 

This invention relates to genes shown to be 
essential for programmed cell death, referred to herein 
as cell death genes, to their encoded products (RNA and 
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polypeptides) , and to antibodies directed against the 
encoded polypeptides. Methods and probes for 
identifying and screening for other cell death genes, 
including those of vertebrates as well as 
5 invertebrates, and possibly, microbes and plants, are 
described • Agents which mimic or affect the activity 
of cell death genes and methods for identifying these 
agents are also described- Bioassays which detect the 
activity of cell death genes and which are useful for 

10 identifying cell death genes, for testing the effect of 
mutations in cell death genes, and for identifying 
agents which mimic or affect the activity of cell death 
genes are also provided. This invention further 
relates to methods for altering (increasing or 

15 decreasing) the activity of the cell death genes or 

their encoded products in cells and, thus, for altering 
the proliferative capacity or longevity of a cell • 
population or organism. 

Specifically, the ced-3 and ced-4 genes of the 

2 0 nematode C, elegrans have been identified, sequenced, 

and characterized. These genes have been shown to be 
required for almost all the prograimed cell deaths 
which occur during development in C. elegrans. Thus, 
two cell death genes and their encoded products (RNA, 
25 polypeptide) are now available for a variety of uses. 

As described herein, the ced-3 and ced-4 genes can 
be used to identify structurally related genes from a 
variety of sources. Some of these related genes are 
likely to also function as cell death genes, 

3 0 Structural comparison of related cell death genes, as 

well as mutational analysis, can provide insights into 
functionally important regions or features of cell 
death genes and gene products. This information is 
useful in the design of agents which mimic or which 
3 5 alter the activity of cell death genes. 
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This invention further provides methods and agents 
for altering (increasing or decreasing) the occurrence 
of cell death in a cell population or organism. 
Methods and agents, described herein, which decrease 
5 cell death are potentially useful for treatment 
(therapeutic and preventive) of disorders and 
conditions characterized by cell deaths, including 
myocardial infarction, stroke, traumatic brain injury, 
degenerative diseases (e.g., Huntington's disease, 

10 amyotrophic lateral sclerosis, Alzheimer's disease, 
Parkinson's disease, and Duchenne's muscular 
dystrophy) , viral and other types of pathogenic 
infection (e.g., human immunodeficiency virus, HIV), 
aging and hair loss. Methods and agents which 

15 increase cell death are also provided and are 

potentially useful for reducing the proliferation or 
size of cell populations, such as cancerous cells,, 
cells infected with viruses (e.g., HIV) or other 
infectious agents, cells which produce autoreactive 

20 antibodies and hair follicle cells. Such methods and 
agents may also be used to incapacitate or kill 
undesired organisms, such as pests, parasites, and 
recombinant organisms , 

Brief Description of the Drawings 

25 Figure 1 shows the genomic organization and 

nucleotide sequence (Seq. ID #1) of ced-4 and deduced 
amino acid sequence (Seq. ID #2) . The genomic sequence 
of the ced-^ region was obtained from plasmid ClODS-5, 
which rescues the cad-4 mutant phenotype. Two likely 

3 0 transcriptional start sites are marked with downward 
arrows . The start of the cDNA is marked with a solid 
arrowhead. The positions of eight ced-4 mutations are 
indicated by upward arrows. Numbers on the sides 
indicate nucleotide positions, beginning at the start 
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of C10D8-5. Numbers under the amino acid sequence 
indicate codon positions. Vertical lines between 
nucleotides indicate splice junctions- 

Figure 2 shows the genomic structure of the ced-4 
5 gene and positions of ced-4 mutations. The sizes of 
exons and introns are indicated in base pairs (bp) . 
The downward arrows indicate the positions of the Tc4 
insertion in the ced-4 (nl416) mutant and of eight EMS- 
induced mutations of ced-'4. The arrow pointing right 
10 indicates the direction of transcription* The solid 
arrowhead indicates the translation initiation site. 
The open arrowhead indicates the ochre termination 
codon. 

Figure 3 shows the sequence similarities between 
15 the Ced-4 protein and some calcium-binding proteins. 
The consensus sequence of the calcium^binding loop is 
shown at the top. The positions indicated by X, Z, 
-X, and -Z correspond to vertices of an octahedron - 
The numbers above the X, Y, Z, -X and -Z correspond to 
20 the positions of the residues within the 29 amino acid 
EF^hand 'sequence, Amino acids are indicated by the 
single letter code, O, amino acid with an oxygen- 
containing side chain, *, non-conserved amino acid, 
Positions Y/'z and -X can be any amino acid with 
25 oxygen-containing side chains. Position X is usually 
aspartic acid, and position -Z is usually glutamic 
acid. Conserved amino acids are shown in bold-face. 
Deviations from the EF-hand consensus sequence are 
underlined, 

3 0 Figure 4 shows the nucleotide sequence (Seq, ID 

#18) of ced-3 and deduced amino acid sequence (Seq. ID 
#19) . The genomic sequence of the ced-3 region was 
obtained from plasmid pJ107, which rescues the ced-3 
mutant phenotype. The likely translation initiation 

35 site is indicated by a solid arrowhead. The SLl splice 



accaptor of the RNA is boxed. The positions of 12 
ced-3 mutations are indicated. Repetitive elements in 
the introns are indicated as arrows above the relevant 
sequence. Nximbers on the sides indicate nucleotide 
positions, beginning with the start of pJ107. Numbers 
under the amino acid sequence indicate codon positions. 

Figure 5A shows the genomic structure of the ced-3 
gene and the location of the mutations. The sizes of 
the introns and exons are given in bp. The downward 
arrows indicate the positions of 12 EMS-induced 
mutations of ced-3. The arrow pointing right indicates 
the direction of transcription. The solid arrowhead 
indicates the translation initiation site. The open 
arrowhead indicates the termination codon. 

Figure 5B shows the locations of the mutations 
relative to the exons (numbered 1-8) and the encoded 
serine-rich region. ; 

Figure 6 is a Kyte-Doolittle hydrophobicity plot 

of the Ced-3 protein. 

Figure 7 shows a comparison of the Ced-3 proteins 
of C. elegans (line 1) and related nematodes, C. 
briggsae (line 2) and C. vulgaris (line 3). The 
conserved amino acids are indicated by Gaps 
inserted in the sequence for the purpose of alignment 
are indicated by 

Figure 8 shows a restriction site map of the ced-4 
region and the relative positions of plasmid C10D8-5, 
plasmid insert pnl416, and three transcripts encoded by 
the region. 

Figure 9 shows physical and genetic maps of the 
ced-3 region on chromosome IV. 

Figure 10 summarizes experiments to localize ced-3 
within C48D1. Restriction sites of plasmid C48D1 and 
subclone plasmids are shown. ced-3 activity was scored 
as the number of cell corpses in the head of LI young 



-7" 

animals. H-h, the niimber of call corpses above 10. +, 
the number of cell corpses below 10 but above 2. 
the number of cell corpses below 2* 

Detailed Description of the Invention 
5 The ced-3 and csd-4 genes of C. elegans have been 

shown to be required for almost all programmed cell 
deaths in C. elegans development (Ellis and Horvitz, 
Cell 44:817-829 (1986)). The present work describes 
the cloning, sequencing and characterization of these 

ID genes. As a result of this work, two genes whose 
activities are required for cell death, referred to 
herein as cell death genes, and their encoded products 
(RNA, polypeptide) are available for a variety of uses. 
Described below are the cloning and characterization of 

15 the C. elegrans ced-4 and ced-3 genes, methods and 
probes for identifying structurally related genes, ^ 
methods for identifying cell death genes from a variety 
of organisms, methods for identifying agents which 
mimic or which affect the activity of cell death genes, 

20 and methods and agents for altering cell death activity 
and thus, for altering the occurrence of cell death in 
a cell population or organism. 

The activity of a cell death gene is intended to 
include the activity of the gene itself and of the 

25 encoded products of the gene. Thus, agents and 

mutations which affect the activity of a gene include 
those which affect the expression as well as the 
function of the encoded RNA and protein. The agents 
may interact with the gene or with the RNA or protein 

3 0 encoded by the gene, or may exert their effect more 
indirectly. 
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The Gene 

The cloning, sequencing and characterization of 
the C. elegans ced-4 gene are described in Example 1. 
Genomic clones were obtained from a ced-4 mutant allele. 
5 generated by transposon tagging. A subclone containing 
as little as 4,4 kb of wild-type genomic DNA was shown 
to complement the ced-4 mutant phenotype (see Table 1; 
tables are located at the end of the Detailed 
Description) • 

10 A 2.2 kb mENA was identified as the ced-4 

transcript. The transcript was shown to be present at 
normal levels in a ced-3 mutant, suggesting that ced-3 
is not a transcriptional regulator of ced-4 gene 
expression • Furthermore, the 2.2 kb transcript was 

15 shown to be expressed primarily during embryogenesis. 
This is consistent with the observation that 113 of the 
131 programmed cell deaths in C. elegans are embryonic 
(Sulston and Horvitz, Dev. Biol. 82:110-156 (1977); 
Sulston et al.. Dev. Biol. 100:64-119 (1983)). 

20 cDNA clones were further obtained and sequenced. 

Analysis of the cDNA and its encoded product indicates 
that the putative Ced-4 protein is 549 amino acids in 
length (Figure 1; Seq. ID #2) and about 62,877 in 
relative molecular mass. The Ced-4 protein is highly 

25 hydrophilic, with a predicted pi of 5.12; there are no 
obvious transmembrane regions. The longest hydrophobic 
region is a segment of 12 amino acids from residues 382 
to 393. 

Sequence analysis of the ced-4 genomic clone and 
3 0 comparison with the cDNA sequence revealed that the 
ced-4 gene contains 7 introns with sizes ranging from 
44 bp to 557 bp (Figure 2). 

The nucleotide sequences of eight EMS-induced 
ced-4 mutations were also determined. Of the eight 
3 5 mutations ; one results in a single amino acid 
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substitution and the other seven appear to prevent 
either ced-4 RNA splicing or completion of Ced-4 
protein synthesis (Figure 2 and Table 2) . These seven 
mutations establish the null phenotype of the ced-4 
5 gene, confirming that ced-4 function is not essential 
for viability. 

Two regions of the inferred Ced-4 protein have 
sequence similarity to known calcium-binding domains 
(Kretsinger, Cold Spring Harbor Symp* Quant. Biol. 

10 52:499-510 (1987)), suggesting that Ced-4 activity and 
hence, programmed cell death may be modulated by 
calcium (see Figure 3 and Example 1) • Calcium has been 
implicated as an essential mediator of cell death in 
other organisms under a variety of conditions. For 

15 example, extracellular calcium is required for 

glucocorticoid-induced thymocyte death (Cohen and Duke, 
J. Imjnunol. 132:38-42 (1984)), for the deaths of adult 
rat hepatocytes induced by certain toxins in vitro ' 
(Schanne et al . , Science 205:700-702 (1979)), for 

20 agonist-induced muscle degeneration in mice (Leonard 
and Salpeter, J. Cell Biol. 82:811-819 (1979)) and for 
neuronal cell death caused by oxygen deprivation or 
excitotoxicity (Coyle et ai., Neurosci . Res. Prog. 
Bull. 19:331-427 (1981); Choi, J. Neurosci. 7:369-379 

25 (1987), Choi, Trends Neurosci. 11:465-469 (1988)). It 
is possible that programmed cell death is initiated 
during C. elegans development by an increase in 
intracellular calcium, which activates the Ced-4 
protein to become cytotoxic. On the other hand, 

3 0 certain cells seem to be protected against cell death 
by calcium (e.g., Koike et al . , Proc. Natl. Acad. Sci. 
USA 85:6421-6425 (1989); Collins et al . , J. Neurosci. 
II : 255'2'-2587 (1991)), suggesting that increases in 
intracellular calcium levels may inhibit the activity 
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of the Ced-4 protein and theraby prevent prograiruned 
cell death. 

The level of the ced-4 transcript in eggs is about 
20% that of the actin 1 transcript, which is relatively. 
5 abundant (Edwards and Wood, Dev. Biol. 97:375--390 
{1983))- This level seems higher than might be 
expected if ced-4 were expressed only in dying cells, 
since in an embryo there are usually no more than two 
or three cells dying at the same time. These 

10 considerations suggest that ced-4 might be transcribed 
not only in dying cells but in other cells as well. 
Perhaps Ced-4 activity, at least during embryonic 
development, is regulated at a post-transcriptional 
level. For example, the Ced-4 protein might have to 

15 interact with other proteins or other factors (such as 
calcium) to cause cell death. Since the ced-3 gene is 
also essential for programmed cell death in C. elagans ^ 
one possibility is that the activity of the Ced-4 
protein is dependent upon ced-3 function. 

20 

The ced-3 Gene 

The cloning, sequencing and characterization of 
the ced-3 gene are described in Example 2. The cecf-3 
gene was cloned by mapping DNA restriction fragment 

2 5 length polymorphisms (RFLPs) and chromosome walking. A 

7.5 kb fragment of genomic DNA was shown to complement 
ced-3 mutant phenotypes. A 2.8 kb transcript was 
further identified. The ced-S transcript was found to 
be most abundant in embryos, but was also detected in 

3 0 larvae and young adults, suggesting that ced-3 is not 

only expressed in cells undergoing programmed cell 
deatH. 

A 2.5 kb cDNA corresponding to the ced-3 mRNA was 
sequenced. The genomic sequence was also determined 
35 (Figure 4; Seq, ID #18) and a comparison with the cDNA 
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sequence revealed that the ced-3 gene has 8 introns 
which range in size from 54 to 1195 bp (Figure 5A) • 
The four largest introns as well as sequences 5' of the 
start codon contain repetitive elements, some of which - 
5 have been previously characterized in non-coding 

regions of other C. elegans genes such as fem-1 (Spence 
et ai.. Cell 60:981-990 (1990)}, lin'12 (J. Yochem, 
personnal communication) , and myoD (Krause et al . , Cell 
53:907-919 (1990) ) • The transcriptional start site was 

10 also mapped, and the ced-3 transcript was found to be 
trans-spliced to a C. elegans splice leader, SLl, 

Twelve EMS-induced ced-3 alleles were also 
sequenced. Eight of the mutations are missense 
mutations, two are nonsense mutations, and two are 

15 putative splicing mutations (Table 3) . The molecular 
nature of these mutations, together with results of 
genetic and developmental analyses of nematodes 
homozygous for these mutations, indicate that, like 
ced-4, ced-3 function is not essential to viability. 

20 In addition, 10 out of the 12 mutations are clustered 
in the C-terminal region of the gene (Figure 5B) , 
suggesting that this portion of the encoded protein may 
be important for activity. 

The ced-3 gene encodes a putative protein of 503 

25 amino acids (Figure 4; Seq. ID #19). The protein is 
very hydrophilic and no significantly hydrophobic 
region can be found that might be a transmembrane 
domain (Figure 6) . One region of the ced-3 protein is 
very rich in serine. Sequence comparison of two 

3 0 additional ced-3 genes from related nematodes, C. 
hriggsae and C. vulgaris , suggests that the exact 
sequence in this serine-rich region may not be 
important but that the serine-rich feature is (Figure 
7; Seq. ID #19-21). This hypothesis is supported by 

35 the analysis of ced-3 mutations: none of 12 EMS-induced 
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ced-3 mutations is in the serine-rich region (Figure 
5B) . 

The conservation of the serine-rich feature among 
the ced-3 genes of different nematodes suggests that 
5 the serine-rich region may act in semi-specific 

protein-protein interactions, similar to acid blobs in 
transcription factors and basic residues in nuclear 
localization signals. In all these cases, the exact 
primary sequence is not important. 

10 It is possible that the serine residues in the 

Ced-3 and Ced-4 proteins may be targets for a Ser/Thr 
kinase, and that the activity of these proteins may be 
regulated post-translationally by protein 
phosphorylation. McConkey et al . (J. Immunol*, 

15 145:1227-1230 (1990)) have shown that phorbol esters, 
which stimulate protein kinase C, can block the death 
of cultured thymocytes induced by exposure to Ca"^*^ , 
ionophores or glucocorticoids (Wyllie, Nature 284:555- 
556 (1980); Wyllie et al., J* Path. 142:67-77 (1934)). 

20 It is possible that protein kinase C may inactivate 
certain cell death proteins by phosphorylation, and 
thus, inhibit cell death and promote cell 
proliferation. Several agents that can elevate 
cytosolic cASP levels have been shown to induce 

25 thymocyte death, suggesting that protein kinase A may 
also play a role in mediating thymocyte death. Further 
evidence suggests that abnormal phosphorylation may 
play a role in the pathogenesis of certain cell- 
degenerative diseases. For example, abnormal 

3 0 phosphorylation of the microtubule-associated protein 
Tau is found in the brains of Alzheimer's disease and 
Down-'^B syndrome patients (Grundke-Iqbal et al,, Proc . 
Natl. Acad. Sci. USA 83:4913-4917 (1986); Flament et 
al.. Brain Res. 515:15-19 (1990)). Thus, it is 

3 5 possible that phosphorylation may have a role in 
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regulating programmed cell death in C. elegans. This 
is consistent with the fairly high levels of ced-3 and 
ced-4 transcripts which suggest that transcription 
regulation alone may be insufficient to regulate 
5 programmed cell death. 

Structurally and Functionally Related Genes 

As a result of the work described herein, it is 
possible to identify genes which are structurally 

10 and/or functionally related to ced-3 or ced-4 . Such 
genes are expected to be found in a variety of 
organisms, including vertebrates (e.g., mammals and 
particularly humans) , invertebrates (e.g. , insects) , 
microbes (e.g., yeast) and possibly plants. 

15 structurally related genes refer herein to genes which 
have some structural similarity to the nucleotide 
sequences (genomic or cDNA) of one or both of the ced-3 
or ced-4 genes, or whose encoded proteins have some 
similarity to one or both of the amino acid sequences 

20 of the Ced-3 or Ced-4 proteins. Functionally related 
genes refer to genes which have similar activity to 
that of ced-3 and ced-4 in that they cause cell death. 
Such genes can be identified by their ability to 
complement ced-3 or ced-4 mutations in bioassays, as 

25 described below. 

Previous studies are consistent with the 
hypothesis that genes similar to the C. elegajis ced-3 
and ced-4 genes may be involved in the cell deaths that 
occur in both vertebrates and invertebrates. Some 

30 vertebrate cell deaths share certain characteristics 

with the programmed cell deaths in C. eleg^ns that are 
controlled by ced-3 and ced-4. For example, up to 14% 
of the neurons in the chick dorsal root ganglia die 
immediately after their births, before any signs of 

35 differentiation (Carr and Simpson, Dev. Brain Res. 
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2:57-162 (1982)), Genes like ced-3 and ced-4 could 
well function in this class of vertebrate cell death. 
In addition, genes related to ced-3 and ced-4 could 
function in many other types of vertebrate cell death 
5 processes, including those involving cells that die 
long after their births and those that die as a result 
of stress {e.g., oxygen deprivation) or disease. 

Genetic mosaic analysis has suggested that the 
ced-3 and ced-4 genes act within cells that undergo 

10 programmed cell death, rather than through cell-cell 
interactions or diffusible factors (Yuan and Horvitz, 
Dev. Biol. 23S: 33-41 (1990)). Many cell deaths in 
vertebrates seem different in that they appear to be 
controlled by interactions with target tissues. For 

15 example, it is thought that a deprivation of target- 
derived growth factors is responsible for vertebrate 
neuronal cell deaths (Hamburger and Oppenheim, 
Neurosci. Comment. 1:39-55 (1982)); Thoenen et ai , , in: 
Selective Neuronal Death, Wiley, New York, 1987, Vol. 

20 126, pp. 82-85). However, even this class of cell 

death could involve genes like ced-3 and ced-4, since 
pathways of cell death involving similar genes and 
mechanisms might be triggered in a variety of ways. 
Supporting this idea are several in vitro and in vivo 

25 studies which show that the deaths of vertebrate as 
well as invertebrate cells can be prevented by 
inhibitors of RNA and protein synthesis, suggesting 
that activation of genes is required for these cell 
deaths (Martin et ai . , J. Cell Biol. 105:829-844 

30 (1988); Cohen and Duke, J, Immunol. 132:38-42 (1984); 
Oppenheim and Prevette, Neurosci. Abstr. 14:368 (1988); 
Stani^ic et al . , Invest. Urol. 15:19-22 (1978); 
Oppenheim et al . , Dev. Biol. 133:104-113 (1990); 
Fahrbach and Truman, in: Selective Neuronal Deaths Ciba 

35 Foundation Symposium , 1987, No. 125, pp* 65-81). It is 
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possible that the genes induced in these dying 
vertebrate and invertebrate cells are cell death genes 
similar to the C. elegans genes ced-3 and ced-4 . 

Also supporting the hypothesis that cell death in 
C. elegans is mechanistically similar to cell death in 
vertebrates is the observation that the protein product 
of the C. elegans gene ced-9 is similar in sequence to 
the human protein Bcl-2. ced-9 has been shown to 
prevent cells from undergoing programmed cell death 
during nematode development by antagonizing the 
activities of ced-3 and ced-4 (Hengartner, et al . , 
Nature 355:494-499 (1992)). The bcl-2 gene has also 
been implicated in protecting cells against cell death. 
It seems likely that the genes and proteins with which 
ced-9 and bcl-2 interact are similar as well. 

Genes which are structurally related to ced-3 or 
ced-4 are likely to also act as cell death genes. ■ 
Structurally, related genes can be identified by any 
number of detection methods which utilize a defined 
nucleotide or amino acid sequence or antibodies as 
probes. .For example, nucleic acid (DNA or RNA) con- 
taining all or part of the ced-3 or ced-4 gene can be 
used as hybridization probes or as polymerase chain 
reaction (PGR) primers. Degenerate oligonucleotides 
derived from the amino acid sequence of the Ced-3 or 
Ced-4 proteins can also be used. Nucleic acid probes 
can also be based on the consensus sequences of 
conserved regions of genes or their protein products. 
In addition, antibodies, both polyclonal and 
monoclonal, can be raised against the Ced-3 and/or Ced- 
4 proteins and used as immunoprobes to screen 
exTJression libraries of genes. 

One strategy for detecting structurally related 
genes in other organisms is to initially probe animals 
which are taxonomically closely related to the source 
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of the probes, for example, probing other worms with a 
ced-3 or ced-4 probe. Closely related species are more 
likely to possess related genes or gene products which 
are detected with the probe than more distantly related- 
5 organisms. Sequences conserved between ced-3 or ced-4 
and these new genes can then be used to identify 
similar genes from less closely related species. 
Furthermore, these new genes provide additional 
sequences with which to probe the molecules of other 

10 animals, some of which may share conserved regions with 
the new genes or gene products but not with ced-3 , 
ced-4, or their gene products. This strategy of using 
structurally related genes in taxonomically closer 
organisms as stepping stones to genes in more distantly 

15 related organisms can be referred to as walking along 
the taxonomic tree. 

Groups of structurally related genes, such asi 
those obtained by using the above-described strategy, 
can be referred to as gene families. Comparison of 

2 0 members within a gene family, or their encoded 

products J. . may indicate functionally important features 
of the genes or their gene products. Those features 
which are conserved are likely to be significant for 
activity. Such conserved sequences can then be used 

25 both to identify new members of the gene family and in 
drug design and screening. For example, as described 
in Example 2, genes similar to ced-3 from two other 
species of nematodes (C. briggsae and C. vulgaris) were 
identified and characterized. Serine-rich regions were 

30 found in the polypeptides encoded by all three genes. 
Although the sequence of the serine-rich region was not 
well qppserved, the number of serines was conserved, 
suggesting that the serine-rich feature, but not the 
exact sequence of the serine-rich region, is 

35 significant for function. 
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Functionally important regions can also be 
identified by mutagenesis. For example, inactivating 
mutations of ced-3 were found to cluster within a 
region near the COOH-terminus (Figure 5B) , suggesting 
5 that this region is a functionally important domain of 
the Ced-3 protein. Further mutational analyses can be 
carried out on the ced-3 and CBd-4 genes; mutants with 
novel properties, as well as other regions important 
for activity, may be discovered. Mutations and other 
10 alterations can be accomplished using known methods, 
such as in vivo and in vitro mutagenesis (see, e.g., 
Ausubel at al . (eds.). Current Protocols in Molecular 
Biology, Greene Publishing Associates and Wiley- 
Interscience, New York) . 

15 Bioassavs and Agents Which Affect the Activity of Cell 
Death Genes 

This invention further provides bioassays which 
detect the activity of cell death genes. The bioassays 
can be used to identify novel cell death genes, to 

20 identify mutations which affect the activity of cell 
death genes, to identify genes which are functionally 
related to known cell death genes, such as ced-3 or 
ced~4 , to identify genes which interact with cell death 
genes, and to identify agents which mimic or affect the 

25 activity of cell death genes (e.g., agonists and 

antagonists) . For example, the bioassays can be used 
to screen expression gene libraries for cell death 
genes from other organisms. 

In this bioassay, genes or agents are introduced 

3 0 into nematodes to test their effect on cell deaths in 
vivo. Wild-type, mutant, and transgenic nematodes can 
be used as appropriate for the effect being tested. In 
one embodiment of this bioassay, transgenic nematodes 
are produced using a candidate cell death gene, a 
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mutant cell death gene, or genes from an expression 
library, to observe the effect of the transgene on the 
pattern of programmed cell deaths during development of 
the nematode. For example, a gene which is 
5 structurally related to ced-3 can be used to produce a 
transgenic animal from a mutant nematode which 
underexpresses or expresses an inactivated ced-3 gene 
to see if the related gene can complement the ced-3 
mutation and is thus, functionally as well as 
10 structurally related to ced-3. cDNA or genomic 

libraries can be screened for genes having cell death 
activity. Genes which interact with cell death genes 
to enhance or suppress their activity can also be 
identified by this method. 
15 In another embodiment of the bioassay, wild-type, 

mutant, or transgenic nematodes are exposed to or 
administered peptides and other molecules in order ^to 
identify agents that mimic, increase, or decrease the 
activity of a cell death gene. For example, wild-type 
20 animals can be used to test agents that inactivate or 
antagonize the activity of ced-3 or ced-4 and hence, 
decrease cell deaths, or that activate or enhance ced-3 
or ced-4 activity and increase cell deaths. Mutant 
animals in which ced-3 or ced-4 is inactivated can be 
25 used to identify agents or genes which mimic ced-3 or 
ced-4 in causing cell deaths. Mutant animals in which 
ced-3 or ced-4 is overexpressed or constitutively 
activated can similarly be used to identify agents that 
prevent ced-3 or ced-4 from causing cell death, 
30 Transgenic animals in which a wild-type or mutant form 
of an exogenous cell death gene causes excess cell 
deathsi due to overexpression or hyperactivity can be 
used to identify agents that inactivate or inhibit the 
activity of the transgene. Similarly, transgenic 
3 5 animals in which a wild-type or mutant form of an 



-19- 

exogenous cell death gene is underexpressed or inactive 
can be used to identify agents that activate or 
increase its activity. Test molecules can be 
introduced into nematodes by microinjection, diffusion,' 
5 ingestion, shooting with a particle gun, or other 
method . 

Mutated cell death genes with novel properties may 
be identified by the above bioassay. For example, 
constitutively activated or hyperactive cell death 

10 genes may be isolated which may be useful as agents to 
increase cell deaths. Mutations may also produce genes 
which do not cause cell death but which antagonize the 
activity of the wild-type gene. 

Agents can be obtained from traditional sources, 

15 such as extracts (e.g., bacterial, fungal or plant) and 
compound libraries, or by newer methods of rationale 
drug design. Information on functionally important 
regions of the genes or gene products, gained by 
sequence and/or mutational analysis, as described 

20 above, may provide a basis for drug design. The 

activity. of the agents can be verified both by in vivo 
bioassays using nematodes which express various forms 
of ced-3, ced~4 , or related genes, as described above, 
and by in vitro systems, in which the genes are 

25 expressed in cultured cells, or in which isolated or 
synthetic gene products are tested directly in 
biochemical experiments. The agents may include all or 
portions of the ced-4, or related genes, mutated 

genes, and all or portions of the gene products (RNA, 

30 including antisense RNA, and protein) , as well as 
nucleic acid or protein derivatives, such as 
oligonucelotides and peptides, peptide and non-peptide 
mimetics, and agonists and antagonists which affect the 
activity or expression of the cell death genes. The 

35 agents can also be portions or derivatives of genes or 
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gene products which are not cell death genes but which 
regulate the expression of, interact with, or otherwise 
affect the function of cell death genes or gene 
products . 

5 Uses of the Invention 

Using the above-described probes and bioassays, 
the identification and expression of ced-J , cad-4 or 
related cell death genes in cultured cells, tissues, 
and whole organisms can be studied to gain insights 

10 into their role in development and pathology in various 
organisms. For example, the detection of abnormalities 
in the sequence, expression, or activity of a cell 
death gene or gene product may provide a useful 
diagnostic for diseases involving cell deaths, 

15 This invention further provides means of altering 

or controlling the activity of a cell death gene in a 
cell, and, thus, affecting the occurrence of cell 
death. Activity of the cell death gene can be altered 
to either increase or decrease cell deaths in a 

2 0 population of cells and, thus, affect the proliferative 
capacity or longevity of a cell population, organ, or 
entire organism. 

Agents which act as inactivators or antagonists of 
the activity of ced-3, ced-4, or other cell death genes 

25 can be used to prevent or decrease cell deaths. Such 
agents are useful for treating (i.e., for both 
preventive and therapeutic purposes) disorders and 
conditions characterized by cell deaths, including 
neural and muscular degenerative diseases, stroke, 

30 traumatic brain injury, myocardial infarction, viral 
(e.g.V'^HIV) and other types of pathogenic infections, 
as well as cell death associated with normal aging and 
hair loss. The agent can be delivered to the affected 
cells by various methods appropriate for the cells or 
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organs being treated, including gene therapy. For 
example, anti-sense RNA encoded by all or a part of a 
cell death gene which is complementary to the mRNA can 
be delivered to a population of cells by an appropriate- 

5 vector, such as a retroviral or adenoviral vector, or 
an antagonist of cell death activity can be infused 
into a wound area to limit tissue damage- 

Methods and agents which cause or increase cell 
deaths are also useful, for example, for treating 

10 disorders characterized by an abnormally low rate or 

number of cell deaths or by excessive cell growth, such 
as neoplastic and other cancerous growth. Such methods 
and agents are also useful for controlling or 
eliminating cell populations, such as cells infected 

15 with viruses (e.g., HIV) or other infectious agents, 
cells producing autoreactive antibodies, and hair 
follicle cells. In addition, methods and agents which 
increase cell death can be used to kill or incapacitate 
undesired organisms, such as pests, parasites and 

20 genetically engineered organisms. All or portions of 
ced-3, ce.d-4, or related cell death genes, active 
mutant genes, their encoded products, agents which 
mimic the activity of cell death genes, and activators 
and agonists' of cell death genes can be used for this 

25 purpose. 

For example, cell death genes can be used to kill 
cells infected with the human immunodeficiency virus 
(HIV), and thus, prevent or limit HIV infection in an 
individual. A recombinant gene can be constructed, in 
3 0 which a cell death gene is under the control of a viral 
promoter which is specifically activated by a viral 
protein; the recombinant gene is introduced into HIV 
infected cells, HIV-infected cells containing the 
viral activator protein would express the cell death 



gene product and be killed, and uninfected cells would 
be unaffected. 

Alternatively, an antagonist of ced-3 or ced-4 
activity (such as antisense RNA) can be expressed under- 
5 the control of a viral-specific promoter and in this 
way, be used to prevent the cell death associated with 
viral (e.g., HIV) infection. 

In another example, cell death genes can be used 
as suicide genes for biological containment purposes. 

10 Genetic engineering of suicide genes into recombinant 
organisms has been reported in bacteria (Genetic 
Engineering News ^ Nov* 1991, p. 13) : suicide genes 
were engineered to be expressed simultaneously with the 
desired recombinant gene product so that the 

15 recombinant bacteria die upon completion of their task. 
The present invention provides suicide genes which are 
useful in a variety of organisms in addition to , 
bacteria, for example in insects, fungi, and transgenic 
rodents. Suicide genes can be constructed by placing 

2 0 the coding sequence of an exogenous cell death gene or 

an agonist of an endogenous cell death gene of the 
organism in an expression vector suitable for the 
organism. 

In addition, agents which increase cell death are 
25 useful as pesticides (e.g., anthelminthics, 

nematicides) . For example, many nematodes are human, 
animal, or plant parasites. ced-3, ced-4, or other 
nematode cell death genes, their gene products, 
mimetics, and agonists can be used to reduce the 

3 0 nematode population in an area, as well as to treat 

individuals already infected with the parasite or 
protect individuals from infection. A transgenic plant 
or animal carrying a constitutively activated ced-3 
gene, ced-4 gene, or other cell death gene specific to 



nematodes can be protected from nematode infection in 
this way. 

The subject invention will now be illustrated by 
the following examples, which are not intended to be 
5 limiting in any way. 2 

EXAMPLE 1 

CLONING, SEQUENCING AND CHARACTET^IZATION OF 
THE CED-4 GENE 

MATERIALS AND METHODS 

10 General Methods and Strains 

Techniques used for the culturing of C. elegans 
were essentially as described by Brenner (Genetics 
77:71-94 (1974)). All strains were grown at 20°C. DNA 
was prepared from worms grown on Petri dishes 

15 containing agarose seeded with E. coli strain H3101. 
RNA was prepared from mass cultures grown in liquid. 
Usually, the bacterial pellet from a 2 L overnight 
culture of E. coli HBlOl grown in superbroth {12 g 
Bacto-tryptone, 24 g yeast extract, 8 ml 50% glycerol, 

20 900 ml H2O; after autoclaving, 100 ml 0.17 M KH2HPO4 

and 0,72 K2HPb4 were added) was resuspended in 500 ml S 
basal medium (Brenner, 1974 supra) , and worrms were 
added from one or two 10 cm Petri dishes in which the 
bacterial lawns had just been consumed. Worms were 

25 harvested about 4-5 days later by centrifugation and 
washed in M9 buffer (Brenner, 1974 supra) . The yield 
was about 5-10 ml of packed worms. 

Nomarski differential interference contrast 
microscopy was used to examine individual cells in 

3 0 living'nematodes (Sulston and Horvitz, Dev. Biol. 
82:110-156 (1977)). Methods for scoring the Ced 
phenotype of ced-1 , ced~4 and ced-I ; ced-4 double 
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lautants have been described by Ellis and Horvitz, {Cell 
44:817-829 (1986)) and by Yuan and Horvitz, {Dev. Biol. 
138:33-41 (1990)). 

The wild-type parent of all mutant strains used in- 
5 these experiments was C. elegans variety Bristol strain 
N2 (Brenner, 1974 supra) . The genetic markers used are 
listed below. These markers have been described 
(Brenner, 1974 supra; Hodgkin et al . , in: The Nematode 
Caenorhabditis elegans, Wood and the Community of C* 
10 elegans Researchers (eds.), Cold Spring Harbor 

Laboratory, New York, 1988, pp, 491-584; Finney et ai . , 
Cell 55:757-769 (1933)). The strain TR679 carries the 
mutator mut-2 (r459) (Collins et ai . , Nature 328:726-728 
(1987)). The ced-4 alleles nl894, nl920, nl947, nl948, 
15 n2247, and n2273 were characterized in the present 

work. Genetic nomenclature follows the standard system 
for C. elegans (Horvitz et al., Mol . Gen. Genet. 
175:129-133 (1979)): 

LG I: ced-l(el735) J. unc-54(r323) 

20 LG III: unc-86 (nl351) ^ ced-4 (nll62, nl416, 

nl894, nl920, nl947, nl948, n2247, 
n2273, nl416 nl712, nl416 nl713), 
unc-79 (el068) , dpy-17 (el64) 
LG IV: ' unc-31(e928) , ced-3 (n717) 
25 LG V: egl-l(n986), unc-76(e911) 

Genomic Libraries 

A 4-6 kb size-selected phage library was 
constructed from ced-4 (nl416) DNA as follows. Genomic 
DNA was digested with Hindlll and run on a low-melting 
3 0 agarose gel. DNA migrating within the 4-6 kb size 
range was excised, and the low-melting agarose was 
removed by phenol extraction and precipitation 
(Maniatis et ai . , Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory (1983)), These 
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DNA fragments were ligated to ffiiidlll-digested DNA from 
phage ).NM1149 (Murray, Phage Lambda and Molecular 
Cloning, Cold Spring Harbor Laboratory, 1983, pp. 395- 
432) • The product DNA was packaged with packaging 
5 extract from Promega, This library had a total of 
140,000 plaque-forming units (pfu) , of which 70% were 
recombinants, as estimated from the ratio of pfu on 
bacteria CSOOhfl and C600, 

The phage genomic library (provided by J. Sulston) 
10 was prepared by partial digestion of wild- type C. 
elegans genomic DNA with Sau3A and cloning into the 
BamEl site of phage vector A.2001 (Karn et al . ^ Gene 
32:217-224 (1984) ) . 

Tc4 Probe 

15 The Tc4 probe used for cloning the ced-4 gene and 

for Southern blots was Tc4-J3l35I, which contains a ,Tc4 
element isolated from an unc-86 {nl351) mutant strain 
(Finney et al.. Cell 55:757-769 (1988); Yuan et al., 
Proc. Natl. Acad. Sci. USA 88:3334-3338 (1991)). DNA 

2 0 was labelled with ^^P using either the nick-translation 
procedure described by Maniatis et al . (1983 supra) or 
the oligo-labelling procedure described by Feinberg and 
Vogelstein {Anal. Biochem, 132:6-13 (1983)). 

RNA Preparation, Northern Blot and Primer Extension 
25 Total C. elegans RNA was extracted using guanidine 

isothiocyanate (Kim and Horvitz, Genes & Dev. 4:357-371 
(1990)). Poly(A)"^ RNA was selected from total RNA by a 
poly (dT) -column (Maniatis et al . , 1983 supra). To 
prepare stage-synchronized animals, eggs were obtained 
30 from gravid C. elegans adults grown at 20°C in liquid 
culture. A 5 - 10 ml sample of animals was treated 
with 50 ml of NaOCl/NaOH solution (10 ml NaOCl, 1 g 
NaOH, 40 ml H2O) for about 10 minutes with vortexing 



-26- 

until the adults were dissolved. Eggs were centrifuged 
and washed three times with M9 buffer- Isolated eggs 
were allowed to hatch in S basal medium without food 
for 14 hours at 20**C with shaking, LI larvae were 
5 collected by low-speed centrifugation after growth on 
E. coli HBlOl for 2 hours, L2 larvae after 12 hours, L3 
larvae after 24 hours, L4 larvae after 3 6 hours and 
adults after 48 hours. Northern blot analysis using 
DNA probes was performed essentially as described by 
10 Meyer and Casson {Genetics 105:29-44 (1986)), except 
that RNA was transferred from the gel to the Gene 
Screen filter (DuPont, Wilmington, DE) by capillary 
action. 

Quantitation of ced-4 expression during embryonic 

15 development was done by hybridizing two duplicate 

northern blots with ced-4 cDNA clone SK2-2 and with a 
genomic DNA clone for the actin 1 gene, pW-16-210, , 
which hybridizes to the 3' untranslated region of the 
actin 1 transcript (Krause and Hirsh, in: Molecular 

20 Biology of the Cytoskeleton , Borisy at ai , (eds.), Cold 
Spring Harbor Laboratory, 1984, pp. 287-292). The two 
probes were of the same specific activity (4 x 10^ 
counts/minute/jLig) . The emission of fl particles from 
the ced-4 and actin 1 bands was counted using a B 

25 counter (Betagen, Waltham, MA). The readings were 7.7 
counts /minute for the actin 1 band and 1.4 counts/ 
minute for the ced-4 band. 

The primer extension protocol was that of Sambrook 
at al . (Molecular Cloning: A Laboratory Manual, 2nd 

30 edition, Cold Spring Harbor Laboratory, 1989, pp. 7.79- 
7.83), using the primer ATTGGCGATCCTCTCGA (Seq. ID 
#22) . To define the lengths of the reaction products, 
a sequencing reaction using this primer and C10D8-5 as 
template was run adjacent to the product of the primer 

3 5 extension reaction in the sequencing gel. 
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Direction of Transcription 

The direction of transcription was determined by 
hybridizing northern blots with single-stranded RNA 
probes. The Bluescribe plasmid containing the insert 
5 pnl416 was linearized by digestion with either BamHl or 
ffindlll, which cleaved at one or the other end of the 
insert. The linearized product was transcribed using 
T3 or T7 RNA polymerase, respectively, generating RNA 
from each strand. These RNA products were used to 

10 probe Northern blots according to a protocol developed 
by Z, Liu and V. Ambros: Filters were prehybridized in 
50% formamide, 50 mM sodium phosphate (pH 6.5), 5 x 
SSC, 8 X Denhardt's, 0.5% SDS, 250 ^q/nl salmon sperm 
DNA and then hybridized with probe at 55 *C and washed 

15 in 4 X SSC, 0.1% SDS at 60 °C 3 times for 20 minutes 

each and then in 2 x SSC, 0.1% SDS once at 60°C for 20 
minutes. Northern blot experiments showed that the 
single-stranded RNA probe transcribed by T3 RNA 
polymerase hybridized to the 2.2 kb ced-4 mRNA, while 

20 the probe made by T7 RNA polymerase did not. This 
result indicates that the direction of the 
transcription is from the BamEI site toward the HindlU 
site of pnl416. 

Determination of DNA Sequence 

25 For determining DNA sequences, serial deletions 

were made according to Henikoff (Gene 2S: 351-359 
(1984)). DNA sequences were determined using Sequenase 
and protocols obtained from US Biochemicals (Cleveland, 
OK) . The ced-4 DNA sequence was confirmed by 

3 0 sequencing both strands of cDNA and genomic DNA clones - 

Cloning of the Cosmid Fragment C10D8-5 

The cosmid C10D8 was digested with £'coRI . Two 
EcoRl fragments of 2.2 kb (rS) and 2,4 kb (r7) , both of 
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which hybridized to a mixture of ced-4 cDNA subclones 
SK2-1 and SK2-2, were isolated. r7, which hybridized 
to SK2-1, which contains the 3' half of ced-4 cDNA 
clone SK2, was cloned into the EcoRl site of plasmid 
5 pBSKII (Stratagene) . The EcoRl site at the 3' end of 
r7 was deleted by digesting with Styl, which cut once 
at 0.2 kb from the 3' end of the insert, and Sail, 
which cut once in the polylinker, and then religating. 
The deleted r7 plasmid was linearized with EcoRl and 
10 ligated with -^coRI-digested r5, which hybridized to 
Sk2-2, the 5' half of ced-4 cDNA SK2 . Clones were 
analyzed for the correct orientation of the r5 insert 
based on the cDNA restriction map. One such correctly 
oriented clone was named C10D8-5. 

15 Microinjection and Transformation 

The procedure for microinjecting DNA into the. 
gonad to obtain germline transf ormants was basically 
that of Fire (EMBO J. 5:2673-2680 (1986)) with 
modifications introduced by J. Sulston. Cosmid DNA to 

2 0 be injected was purified twice using CsCl-gradient 

centrifugation (Maniatis et al . , 1983 supra), Plasmid 
DNA to be injected was prepared by alkaline minipreps 
(Maniatis et'al., 1983 supra). DNA was treated with 
RNAase A {37*='C, 30 minutes) and then with proteinase K 

25 (55^*0, 30 minutes), extracted with phenol and then 

chloroform, precipitated twice (first in 0.3 M sodium 
acetate and then in 0.1 M potassium acetate, pH 7.2), 
and resuspended in 5 ul of injection buffer (Fire, 198 6 
supra) . DNA concentrations used for injection were 

30 0.1-1.0 mg/ml. 

All transformation experiments used a ced-1; 
ced-4 (1x1162) ; unc-31 strain as the recipient. The 
expression of the Ced-4 phenotype was quantified by 
counting the number of cell corpses in the heads of 



young LI animals. The cosmid C10D8 or plasmid 
subclones of C10D8 were mixed with cosmid C14G10, which 
contains the wild-type unc~31 (-h) gene, at a ratio of 
2:1 or 3:1 to increase the likelihood that a 
5 phenotypically non-Unc transformant would contain the 
cosmid or plasmid being tested. Generally, 2 0-3 0 
animals were injected in one experiment. Non-Unc Fl 
progeny of injected animals were isolated three to four 
days later. About 1/2 to 1/3 of the non-Unc progeny 
10 transmitted the non-Unc phenotype to their progeny and 
could be established as lines of transf ormants . Young 
LI non-Unc progeny of such non-Unc transf ormants were 
examined using Nomarski optics to determine the number 
of cell corpses present in the heads. 

15 Ced-4 Fusion Protein and Antibody Preparation 

To express a Ced-4 fusion protein in E. coll, ^a 
clone containing both the 5^ and 3' halves of the ced-4 
cDNA (SK2-2 and SK2-1) in the expression vector pET-5a 
(Rosenberg at al.. Gene 55:125-135 (1987)) was 

20 constructed. The fusion protein expressed by this 

vector was expected to include 11 amino acids of phage 
T7 gene 10 protein, 5 amino acids of linker and the 546 
amino acids encoded by ced-4 cDNA SK2 . The pJ7 6 
plasmid, which encodes this fusion protein, was 

25 transformed into bacterial strain BL21* ced-4 fusion 
protein was produced by this transformed strain, as 
expected, and subjected to electrophoresis on a 
polyacrylamide gel. A band, with mobility equivalent 
to about 64 X 10-^ Mr, specific to the transformed 

3 0 strain was exercised and used to immunize three 

rabbits. Sera from all three rabbits tested positive 
on wesrern blots (Towbm et al . , Proc. Natl. Acad. Scz . 
OSA 75:4350-4354 (1979)). These sera were purified 
using immunoblots (Harlow and Lane, Antibodies : A 
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Laboratory Manual, Cold Spring Harbor Laboratory, 
1988) . 

RESULTS 

Cloning of the ced~4 Gene by Transposon Taaaing 
5 The ced-4 allele nl416 in the C. elegans strain 

TR679 was isolated, which carries the mutator mut- 
2(r459) and shows an elevated frequency of 
transposition elements (Collins et al.. Nature 328:726- 
728 (1987); Yuan et al., Proc. Natl. Acad. Sci. USA 

10 88:3334-3338 (1991)). The ced-4 (nl416) mutation is 
closely linked to a newly transposed copy of the C. 
elegans transposon Tc4 (Yuan et al » , 1991 supra). 
Using Tc4 as a probe, this novel Tc4 element and its 
flanking region was cloned as a 5 kb Hindlll fragment 

15 from a 4-6 kb size-selected ced-4 (nl416) genomic phage 
library. A 3 kb adjacent to this Tc4 element was 
isolated by digesting the 5 kb ffindlll fragment with 
BamEl. This 3 kb fragment, called pnl416, was cloned 
into the -Bluescribe M13+ plasmid vector (Stratagene) . 

2 0 When used as a probe on Southern blots, pnl416 

hybridized to^ a 3.4 kb Hindlll fragment in DNA of wild- 
type (strain N2) and two non-Ced revertants of 
ced-4 (nl416) , ced-4 (nl416 nl712) and ced-4 (nl416 nl713) 
(Yuan and Horvitz, Dev. Biol. 138:33-41 (1990)), and a 
25 5 kb ifindlll fragment in ced-4 (nl416) animals. The 
hybridizing band in ced-4 (nl416) DNA is 1.6 kb larger 
than that of the wild-type or the revertants, 
indicating that an insertion of this size is present in 
the ced-4 (nl416) strain and is deleted in both 

3 0 revertants. These observations indicate that the Tc4 

insertion in ced-4 (nl416) animals is responsible for 
their Ced-4 mutant phenotype and suggest that pnl416 
contains at least part of the ced-4 gene. 



-31- 

To isolate additional genomic DNA from the region 
of this Tc4 insertion, pnl416 was used to probe a C. 
elegans Bristol N2 genomic DNA phage library. Five 
phage clones with inserts of 10 to 
5 15 kb were isolated and shown to share a 3 kb BamEl- 
Hindlll fragment that hybridized to pnl415. These 
phage clones were used to identify cosmids that 
hybridized to them and that were members of a 600 kb 
contig of overlapping cosmids (Coulson et al . , Proc. 
10 Natl. Acad. Sci. USA 83:7821-7825 (1986)). By using 
the phage clones as probes to hybridize to Southern 
blots, a cosmid C10D8 was identified as containing all 
regions of genomic DNA present in all five phage clones 
and in pnl416. 

15 The ced-4 Mutant Phenotvpe Can Be Rescued by a 4.4 kb 
DNA Fragment , 

To identify ced-4 (+) DNA capable of complementing 
the Ced-4 mutant phenotype, the cosmid C10D8 was 
injected into the oocytes of ced-4 (nll62} animals. To 

20 facilitate the identification of transgenic animals, a 
mutation in the unc'31 gene, which affects locomotion, 
was included as a marker for co-transformation (Kim and 
Horvitz, Genes & Dev. 4:357-371 (1990)). Cosmid 
C14G10, Which contains the wild-type allele of unc-31 

25 and does not have Ced-4-rescuing activity was 
coinjected with cosmid C10D8 into ced-1 (el735) ; 
unc-31 (e928) ; ced-4 (nll62) animals. The ced-1 mutation 
was included to facilitate the scoring of the ced-4 
mutant phenotype (Ellis and Horvitz, Cell 44:817-829 

30 (1986)). Specifically, when a cell undergoes 

programmed cell death in C. elegans, its corpse is 
quickly engulfed and destroyed by a neighboring cell 
(Robertson and Thomson, J. Embryo 1 . Exp. Morph. 57:89- 
100 (1982); Sulston et al . , Dev. Biol. 200:64-119 
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(1983)). A ced-1 mutation prevents this engulfment, 
allowing the cell corpse to remain intact (Hedgecock et 
al.. Science 220:1277-1280 (1983)). Thus, in a first 
or second stage (LI or L2) ced-l mutant larva, many 
5 cell corpses are present and can be easily visualized 
using Normaski optics. cad''4 mutations prevent cell 
death and the appearance of these corpses. Thus, 
suppression of the Ced-4 mutant phenotype by a wild- 
type ced-4 gene can be observed and readily quantified 

10 in a ced-l mutant background based on an increase in 
the number of visible cell corpses. 

From one such microinjection experiment, three 
non-Unc animals rescued for the Unc-3l mutant phenotype 
were picked from among the Fl progeny, and from one of 

15 them a line of non-Unc transf ormants was obtained. No 
true-breeding non-Unc animals could be isolated from 
this line: about 25% of the progeny of all non-Unc , 
animals were Unc. Since no inviable zygotes were 
observed among the progeny of these non-Unc animals, 

20 this transf ormant did not carry a recessive lethal 

insertion mutation. Rather, it seems likely that the 
injected DNA was maintained as an extrachromosomal 
array that was segregated to only some gametes, as has 
been reported previously for many other C. elegans 

25 transgenic strains (e.g., Stinchcomb et al . , Mol . Cell 
Biol. S2:110-156 (1985); Fire, EMBO J . 5:2673-2680 
(1986); Way and Chalfie, Cell 54:5-16 (1988)). This 
putative extrachromosomal array was named nExl. Young 
LI progeny of nZxi -containing animals were examined 

3 0 using Nomarski optics for the Ced-4 phenotype. 

Young LI ced-I animals have an average of 23 cell 
corps.^s in the head, while ced-1 (el735) ; ced-4 (nll62) 
animals have an average of 0,5 cell corpses (Ellis and 
Horvitz, Cell 44:817-829 (1986)). Young LI ced-I; 

35 cBd-4 (nll62) ; nExl animals had an average of nine cell 
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corpses in the head. These results indicate that 
cosmid C10D8 restored significant, but not total, 
ced-4^-f; activity in the transf ormants. 

To delineate the ced-4 gene within C10D8, various . 
5 subclones of C10D8 were injected into ced-4 mutant 
animals and tested for their ability to rescue the 
Ced-4 mutant phenotype (Table 1) • The smallest 
subclone plasmid that could rescue the ced-4 phenotype 
as effectively as cosmid C10D8 was a 4.4 kb fragment, 

10 called C10D8-5. C10D8-5 and the U23c-31 -containing 
cosmid C14G10 were coinjected into ced-I; unc-31; 
ced'4 (nll62) animals. Two lines' of non-Unc 
transf ormants were isolated. Since these animals 
continued to segregate Unc animals and did not produce 

15 inviable zygotes, both appeared to carry 

extrachromosomal arrays, which were designated nEx7 and 
nExB. Young LI animals from these transf ormant strains 
had an average of 11.5 cell corpses in their heads, 
indicating that plasmid C10D8-5 restored ced-4 

2 0 activity as well as did cosmid C10D8 (Table 1) . 

Identification of a ced-4 Transcript 

Restriction sites of plasmid C10D8-5 (which can 
rescue the Cfed-4 phenotype) and pnl416 (which contains 
sequences adjacent to the Tc4 insertion site) were 
25 mapped. C10D8 05 was found to overlap with 2 kb of 
sequence in pnl416, including the Tc4 insertion site 
(Figure 8) . 

In Northern blot experiments, both pnl416 and 
C10D8-5 were used to probe poly (A) RNA populations of 
30 mixed developmental stages of wild-type (strain N2) , 
ced-4 (nl416) , and ced-4 (nl416 nl712) and ced-4 (nl416 
nI713)" revertant animals. pnl416 hybridized to a 2 . 2 
kb transcript and an 0.9 kb transcript in RNA from N2 
animals, and a 3 kb transcript, a transcript slightly 
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larger than the wild-type 2.2 Xfa transcript, and a 
transcript slightly smaller than the wild-type 0.9 kb 
transcript in ced-4 (nl416) animals. The 3,8 kb RNA 
contained Tc4 sequence (see below) , suggesting that 
5 this RNA resulted from the insertion of the 1.6 kb Tc4 
sequence into the ced-4 sequence encoding 2.2 kb 
transcript. The transcript slightly larger than the 
2.2 kb wild- type transcript did not contain Tc4 
sequence. This ced-4 (nl416) RNA might have been an 

10 aberrant transcript containing sequences adjacent to 
the ced-4 gene: when pnl415 was used as a probe, the 
wild-type 2.2 kb and the slightly larger transcript in 
this mutant were relatively similar in intensities, 
whereas when ced-4 cDNA clone SK2-1 was used as a 

15 probe, this mutant transcript was not detected (see 
below) . These observations indicate that the 
ced-4 {nl416) 2.2 kb transcript contains sequences from 
the ced-4 region but does not contain sequences 
corresponding to at least the 3' half of the ced-4 

20 mRNA. The two revertants of ced-4 (nl416) , ced-4 (nl416 
nl712) and ced-4 (nl416 nl713) , contained both 2.2 kb 
and 0.9 kb transcripts with similar sizes to the wild- 
type transcripts. Thus, both the 2.2 kb and the 0.9 kb 
transcripts were altered in ced-4 (nl416) animals, and 

25 both were restored in the two non-Ced revertants. 

To determine if any of the transcripts contains 
Tc4 sequence, the Northern blots were probed with 
Tc4-nl35I, which contains the 1.6 kb Tc4 element 
present in the Tc4-induced mutant unc-86 (nl351) as well 

3 0 as 4 kb of unc-86 sequences. Tc4-J2l35I hybridized both 
to a 3.8 kb transcript of the Tc4-induced mutant 
ced-4 (nl416) and to a 1.5 kb unc-68 transcript in both 
ced-4 (nl416) and N2 animals. 

To determine whether one or both of the 2 . 2 kb and 

35 0.9 kb transcripts are encoded by ced-4, subclone 
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C10D8-5, Which rescued the Ced-4 phenotype, was used to 
probe the Northern blots. C10D8-5 detected the wild- 
type 2.2 kb transcript, the ced-4 (nl416} transcript 
slightly larger than the 2,2 kb transcript, and the 
5 ced-4 (nl416) 3.8 kb transcript • C10D8-5 did not 
hybridize to the 0.9 kb transcript, indicating that 
this transcript is unlikely to be encoded by ced-4. 
C10D8-5 also detected a 1.4 kb transcript, which was 
not altered by the Tc4 insertion in ced-4 (nl416) 

10 animals. Only a 470 bp ^coRI-StuI fragment at one end 
of C10D8-5 hybridized to this 1.4 kb RNA. Since 
C10D8-5 did not contain the complete coding region for 
this RNA; and since this RNA was unaffected in 
ced-4 (nl416) animals, this 1.4 kb RNA seems unlikely to 

15 be a ced-4 transcript. The relationships among cosmid 
C10D8-5, pnl416 and the 0.9 kb, 1.4 kb and 2.2 kb 
transcripts are summarized in Figure 8. 

On Northern blots probed with the ced-4 cDNA clone 
SK2-1, the level of the 2.2 kb transcript showed 

20 significant reduction in all three independently 

derived EMS-induced ced-4 mutants examined, strongly 
supporting the hypothesis that this 2 . 2 kb transcript 
is a ced-4 transcript. Total RNA from N2, 
ced-4 (nll62) / ced-4 (nl416) , ced-4 (nl894 ) and 

25 ced-4 (nl920) eggs was probed with ^^P-labelled ced-4 
cDNA SK2-1. An act in 1 probe (Krause and Hirsh, in: 
Molecular Biology of the Cytoskeleton , Borisy at ai . 
(eds.), Cold Spring Harbor Laboratory, 1984, pp. 287- 
292) was used as an internal control for the amount of 

3 0 RNA loaded in each lane. The ratios of the intensity 
of the ced-4 band to that of actin band in N2 , nll62^ 
nl416 and nl894 were 0.5, 0.17, 0 and 0.12, 
respectively. A Northern blot of poly (A) + RNA from 
stage-synchronized animals was probed with pnl4l6, 

35 which hybridizes both to the 2 . 2 kb ced-4 transcript 
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and to a 0.9 kb transcript. The 0.9 kb transcript 
seems to be expressed mostly in eggs and adults. The 
presence of RNA in all lanes was confirmed by loading 
1/10 of each sample on another gel and probing a 
5 Northern blot from this gel using the C. elegans act in 
1 gene (Krause and Hirsh, 1984 supra) . That all of 
these distinct ced~4 mutations cause reduced levels of 
a ced-4 transcript could reflect either instability of 
all three mutant transcripts or a role for ced-4 in 

10 regulating its own expression. 

Based upon these results , it can be concluded that 
the 2.2 kb RNA is a ced-4 transcript. It is not known 
why the 0.9 kb RNA is also altered in ced-4 (nl416) 
animals. Perhaps transcription of the 0.9 kb RNA is 

15 initiated incorrectly as a consequence of the nearby 
Tc4 element. 

ced-4 Expression is Primarily Embryonic 

A Northern blot containing RNAs from stage- 
synchronized animals of different developmental stages 

20 probed with pnl416 showed that the 2.2 kb ced-4 
transcript was expressed primarily during 
embryogenesis. This result is consistent with the 
observation that 113 of the 131 programmed cell deaths 
in the C. elegans hermaphrodite are embryonic (Sulston 

25 and Horvit2, Dev. Biol. 82:110-156 (1977); Sulston et 
al., Dev. Biol. 100:64-119 (1983)). The 2.2 kb RNA was 
relatively abundant during embryonic development. The 
0.9 kb transcript was expressed mostly in eggs and 
adults. The presence of RNA in all lanes was confirmed 

3 0 by loading 1/10 of each sample on another gel and 
probing a Northern blot from this gel with the C, 
elegans actin 1 gene (Krause and Hirsh, 1984 supra) . 
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The ced-4 Transcript is Present in a ced- 3 Mutant 

The activities of both ced-3 and ced-4 are 
required for programmed cell death (Ellis and HorvitZ; 
Cell 44:817-819 (1986)). One possibility is that one 
5 of these genes positively regulates the expression of 
the other. For this reason, a Northern blot of wild- 
type strain N2 and ced-3(n717) poly (A) RNA was probed 
with pnl416. This experiment showed that the 2 . 2 kb 
ced-4 transcript was present at an apparently normal 
10 level in this ced-3 mutant. Thus, the activity of the 
ced-3 gene is unlikely to be necessary for the 
expression of the cad-4 2,2 kb transcript. 

Identification of ced-4 cDNA Clones 

To isolate cDNA clones of ced-4, pnl416 was used 

15 to probe a C. elsgans cDNA phage library made from 
wild-type strain N2 mixed-stage RNA (Kim and Horvitz, 
Genes & i3ev. 4 : 357-371 (1990)). Two cDNA clones were 
isolated. The two cDNA clones (named SKI and SK2) 
hybridized to the 2.2 kb ced-4 transcript. Both are 

20 about 1.8 kb in size, and both contain one 0.8 kb and 
one 1.0 kb EcoRl fragment. These EcoRl fragments were 
subcloned into plasmid vector Bluescribe M13+ 
(Stratagene) The two subclones derived from SKI were 
named SKl-1 and SKl-2, and the two subclones derived 

25 from SK2 were named SK2-1 and SK2-2. The restriction 
maps of the SKI- and SK2 -derived clones were the same. 
Sequence analysis of the ends of the four cDNA 
subclones confirmed the equivalence of the SKI and SK2 
clones, except that SKl-2 contains a poly (A) sequence 

3 0 of more than 50 bp at its 5' end. This poly (A) 

sequence is probably a cDNA cloning artifact, since 
SKl-2 'contains the 5' half of the cDNA (see below). 
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The ced-4 Sequence 

The DNA sequence of the SK2 1.8 kb cDNA clone was 
determined. This sequence includes an open reading 
frame encoding 546 amino acids (Figure 1; Seq. ID #2) , 

5 which is consistent with the results of Northern blot 
analysis using single-stranded KNA probes. An ochre 
termination codon (TAA) is located in-frame near the 3' 
end, indicating that the 3' end of the 2.2 kb 
transcript is most likely included in this cDNA. The 

10 open reading frame extends to the 5' end of the 1.8 kb 
cDNA, suggesting that this cDNA might lack the 5' end 
of the ced-4 coding region. 

A primer extension experiment was performed to 
determine the ced-4 transcription initiation site(s) 

15 using the primer ATTGGCGATCCTCTCGA (Seq. ID #23) and 
C10D8-5 as template. A major transcriptional 
initiation site was identified at 54 bp before (5' of) 
the beginning of the ced-4 cDNA SK2 and a minor 
initiation site at 54 bp after (3' of) the beginning of 

20 this cDNA (Figure 1) . The first AUG codon after the 
presumptive major start site is located at 9 bp before 
the beginning of the cDNA (Figure 1) . If this site is 
used to initiate protein synthesis, the Ced-4 protein 
would be 549 'cimino acids in length. The first AUG 

25 codon after the presumptive minor start site is located 
at 130 bp after the beginning of the cDNA. If this 
site is used, the Ced-4 protein would be 503 amino 
acids in length. Preliminary results using an anti- 
Ced-4 antibody raised against a Ced-4 fusion protein 

30 showed that endogenous Ced-4 protein is slightly 

smaller in molecular weight than a Ced-4 fusion protein 
of 562 amino acids expressed in E. coll. Thus, most 
Ced-4 ' protein is probably initiated near the start of 
the cDNA and is presumably 549 amino acids in length 

3 5 and 62,977 in relative molecular mass. The direction 
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of the open reading frame is consistent with the 
direction of transcription, as demonstrated by probing 
Northern blots with single-stranded RNA probes- The 
presumptive Ced-4 protein is highly hydrophilic, with a. 
5 pi of 5.12* The longest hydrophobic region is a 
segment of 12 amino acids from residues 382 to 393. 

A Western blot of wild-type strain N2 mixed-stage, 
ced-4 (nl416) mixed-stage, wild-type egg, and 
bacterially expressed protein (pJ76) was probed using 

10 anti-Ced-4 antibody. Ced-4 fusion protein (pJ7 6) was 
made by cloning ced-4 cDNA SK2 into the T7 expression 
vector pET-5a (Rosenberg et al./Gene 55:125-135 
(1987)), so that 546 amino acids of Ced-4 sequence were 
fused to 11 amino acids of T7 gene 10 protein and 5 

15 amino acids of linker sequence. This Ced-4 fusion 
protein is similar in relative molecular mass to the 
endogenous Ced-4 protein, which is present in wildrtype 
(N2) but missing in ced-4 (nl416) animals. The proteins 
phosphorylase b, 97 x 10^; bovine serum albumin, 66 x 

20 10-^ (Hirayama et al., Biochejn. Biophys^ Res. Comm. 

173:639-646 (1990)); and ovalbumin, 43 x 10^, were used 
as molecular weight standards. 

To confirm the DNA sequence obtained from the 
cDNAs and to study the structure of the ced-^ 

25 gene, the sequences of the 4.4 kb cosmid subclone 

C10D8-5, the 3 kb insert pnl416, and the 2 kb Hindlll- 
BamEl fragment that contains the Tc4 insertion in the 
ced-4 (nl416) mutant were determined. Comparison of the 
ced-4 genomic and cDNA sequences revealed that the 

30 ced-4 gene has seven introns of sizes ranging from 44 
bp to 557 bp (Figure 2) . The exon sequences of genomic 
clone C10D8-5 are identical to the sequences of ced-4 
cDNA SKS. Comparison of the Tc4 insertion site m 
ced~4(nl416) DNA with the ced-4 (4-) genomic and cDNA 
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sequences indicated that Tc4 inserted into an exon in 
the ced-4 gene in ced'4 (nl416) animals (Figure 2). 

The DNA sequences of eight EMS-induced ced-4 
alleles were also determined (Table 2) . One of the 
eight, nl948, is a missense mutation. Of the seven 
others, four create stop codons and three are predicted 
to affect splicing of the ced-4 transcript. The 
positions of these mutations are indicated in Figure 2. 
These findings indicate that the phenotypes of these 
mutants (Ellis and Horvitz, Ceil 44:817-829 (1986)) 
result from a complete loss of ced-4 gene function • 
These mutations establish the null phenotype of the 
ced-4 gene, confirming that ced-4 function is not 
essential for viability. 

15 The Ced-4 Protein Has Two Regions Similar to Known 
Calcium-Binding Domains 

By direct inspection, the sequence of the putative 
Ced-4 protein was compared with the consensus sequence 
of the calcium-binding loop of the EF-hand domain 

20 (Tufty and Xretsinger, Science 287:161-171 (1975); 
Kretsinger, Cold Spring Harbor Symp. Quant. Biol. 
52:499-510 (1987); Szebenyi and Moffat, J. Biol. Chem. 
25:8761-8777*^(1986)). Two regions of the Ced-4 protein 
were identified that might bind calcium (Figure 3) . 

25 The EF-hand is a 29 amino acid domain consisting 

of a helix-loop-helix region, with the loop portion 
(residues 10-21) coordinating calcium-binding via the 
side-chain oxygens of serine, threonine, asparagine, 
aspartic acid, glutamine or glutamic acid. These 

3 0 residues occur at five of the vertices of an 

octahedron: X (position 10), Y (12), Z (14), -x (18), 
-Z (21) . EF-hand amino acid sequences vary 
considerably in the residues present in the calcium- 
binding loop (Figure 3), and some EF-hand domains have 
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only one helical region (Kretsinger, 1987 supra) . The 
consensus sequence is shown at the top of Figure 3. 
Positions Y, z, and -X can have any of a niimber of 
amino acids which have oxygen-containing side chains • 
5 Position X is usually aspartic acid, and position -2 is 
usually glutamic acid. 

The sequences of parvalbumins from carp muscle 
(Seq. ID #3; Nockolds et al., Proc. Natl. Acad. Sci. 
USA 59:581-584 (1972)}, the intestinal calcium-binding 
10 protein (ICaBP) (Seq. ID #7-8; Szebenyi et al.. Nature 
254:327-332 (1981)), troponin C (Seq. ID #9-12; Collins 
et al., FE3S Lett. 36:268-272 (1973)) and calmodulin 
(Seq. ID #13; Zimmer et al . , J. Biol. Cham. 253:19,370- 
19,383 (1988); Babu et ai.. Nature 315:37-40 (1985)) 
15 show canonical EF-hands. The hake and ray parvalbumins 
(Seq. ID #4-5; Capony et al. Eur. J. Biochem. 32:97-108 
(1973)); Thatcher and Pechere, Eur. J. Biochem. 75:121- 
132 (1977)), sarcoplasmic calcium-binding protein ' 
(SCBP) from the protochordate Amphloxus (Seq. ID #6; 
20 Takagi et al.. Biochemistry 25: 3585-3592 (1986)), 
trypsinogen (Seq. ID #14; Bode and Schwager, J. Mol . 
Biol. 58:693-717 (1975)), fibrinogen (Seq. ID #15; 
Doolittle, Ann. Rev. Biochem. 53:195-229 (1984); Dang 
et ai., J. Biol. Chem. 250:9713-9719 (1985)), villin 
25 (Seq. ID #16; Hesterberg and Weber, J. Biol. Chem. 

258:365-369 (1983)) and galactose-binding protein (GBP) 
(Seq. ID #17; Vyas et al.. Nature 327:635-638 (1987)) 
show variations from the consensus sequence. GBP does 
not contain the helices of the EF-hand. 
3 0 The potential calcium-binding loops of sequence 1 

and sequence 2 are located at amino acids 77-88 and 
amino acids 292-303 of the Ced-4 protein, respectively 
(Figur*^?^ 3) . In its putative calcium-binding loop, the 
first potential EF-hand-like sequence of the Ced-4 
35 protein has four (positions Y, Z, -X, -Z) of the five 



conserved residues with oxygen-containing side chains 
(shown in bold) , and the fifth position (X) has a 
tyrosine rather than an aspartic acid; tyrosine 
contains oxygen in its side chain. The second 
potential EF-hand-like sequence of the Ced-4 protein 
has three residues (positions Z, -X, -Z) that match the 
consensus sequence, and amino acids with oxygen- 
containing side chains at the other two positions. 
These observations suggest that these two regions of 
the Ced-4 protein might bind calcium. Like the Ced-4 
protein, a number of known calcium-binding proteins, 
such a bovine intestinal calcium-binding protein 
(ICaBP) (Szebenyi and Moffat, 1986 supra), rabbit 
troponin C (Collins et ai . , 1973 supra), trypsinogen 
and villin (Doolittle, 1984 supra; Danget et ai . , 1985 
supra) have only three or four conserved residues at 
these five positions (Figure 3). The EF-hand domains 
in ICaBP and troponin C have been shown by X-ray 
crystallography to bind calcium. 

One major difference between the Ced-4 protein and 
the calcium-binding loop of the EF-hand consensus 
sequence is at position 15. Here, the two Ced-4 
sequences have a histidine and a glutamic acid, 
respectively; whereas most ET-hand-containing proteins 
have a glycine; this glycine has been suggested to be 
important for the turning of the loop (Kretsinger, 1987 
supra) . However, a histidine is present at this 
position in a parvalbumin and an aspartic acid is 
present in another parvalbumin and also in a 
sarcoplasmic calcium-binding protein (Kretsinger, 1987 
supra) (Figure 3). Thus, the presence of histidine or 
glutamic acid at position 15 does not rule out the 
possibility that these regions bind calcium. 

The calcium-binding loop (positions 10-21) of the 
EF-hand is thought to be preceded (positions 1-9) and 



followed by alpha-helical domains (positions 22-29) 
(Kretsinger, 1987 supra) . Since position 3 of Ced-4 
sequence 1 and positions 26 and 28 of Ced-4 sequence 2 
are prolines, these regions might not form alpha- 
helices. However, the known calcium-binding protein 
galactose-binding protein (GBP) has a calcixim-binding 
domain similar to that of the EF-hand (Figure 3) but 
without the two helices; furthermore, position 29 of 
GBP is proline (Vyas et al . , 1987 supra). Thus, the 
Ced-4 protein need not contain such alpha-helical 
calcium-binding domains. 

Based upon these considerations, it seems likely 
that the Ced-4 protein binds calcium or a similar 
divalent cation. 



EXAMPLE 2 

CLONING. SEQUENCING, AND CHARACTERIZATION OF 

THE CED-3 GENE 

MATERIALS AND METHODS 

General Methods and Strains 

The techniques used for the culturing of C. 
elegans were as described by Brenner {Genetics 77:71-94 
(1974)). All strains were grown at 20**C. The wild- 
type parent strains were C. elegans variety Bristol 
strain N2 , Bergerac strain EM1002 (Emmons et al . , Cell 
32:55-65 (1983)), C. briggsae and C. vulgaris (obtained 
from V. Ambros) • The genetic markers used are 
described below. These markers have been described by 
Brenner (1974 supra), and Hodgkin et ai , (In: The 
Nematode Caenorhabditis elegans, Wood and the Community 
of C. elegans Researchers (eds.). Cold Spring Harbor 
Laboratory, 1938, pp 491-584). Genetic nomenclature 
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f ollows the standard system (Horvitz at al. , Mol . Gen . 
Genet. 175:129-133 (1979)). 

LG I: ced-l(el375) ; unc-54(r323) 
LG VI; unc-31(e928) , unc-30 (el91) , ced-3(n717, n718, 
nl040, nll29, nll63, nll64, nll65, nizae, 
nl949, n2426, n2430, n2433) , VLnc-26 (e205) , 
dpy-4 (ell66} 

LG V: egl-l(n986) ; unc-76(e911) 

LG X: dpy-3(e27) 



10 Isolatio n of Additional Alleles of ced-3 

A non-compleaientation screen was designed to 
isolate new alleles of ced-3. Because animals 
heterozygous for ced-3 (n717) in trans to a deficiency 
are viable (Ellis and Horvitz, Cell 44:817-829 (1986)), 

15 animals carrying a complete loss-of-function cec?-3 ^ 
allele generated by mutagenesis were expected to be 
viable in trans to ced-3 Cji 717; , even if the new allele 
was inviable in homozygotes. Fourteen EMS mutagenized 
egrl-I males were mated with ced-3 (n717) unc-26 (e205) ; 

20 egrl-l('j348 7;; dpy-3(e27) hermaphrodites, egl-l was used 
as a marker in this screen. Dominant mutations in 
egrl-1 cause the two hermaphrodite specific neurons, the 
HSNs, to undergo programmed cell death (Trent et al., 
Genetics 104:619-647 (1983)). The HSNs are required 

25 for normal egg-laying, and egl-l (n986) hermaphrodites, 
which lack HSNs, are egg-laying defective (Trent et 
al., 1983 supra). The mutant phenotype of egl-l is 
suppressed in a ced-3; egl-l strain because mutations 
in ced-3 block programmed cell deaths. egi-i males 

30 were mutagenized with EMS and crossed with ced-3 (n717) , 
unc-26 (e205} ; egl-l (n487 ) ; dpY-3(e27). Most cross 
progeny were egg-laying defective because they were 
heterozygous for ced-3 and homozygous for egl-l. Rare 



egg-laying competent animals were picked as candidates 
for carrying new alleles of cecf-3. Four such animals 
were isolated from about 10,000 Fl cross progeny of 
EMS-mutagenized animals. These new mutations were made 
homozygous to confirm that they carried recessive 
mutations of ced-3 . 

Molecul ar Biolnrry 

Standard techniques of molecular biology were used 
(Maniatis et ai.. Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, 1983). 

Two cosmid libraries were used extensively in this 
work: a SauSAI partial digest genomic library of 7000 
clones in the vector pHC79 and a Sau3AI partial digest 
genomic library of 6000 clones in the vector pJB8 (Ish- 
Horowicz and Burke, Nucleic Acids Res. P:2989 (1981)). 

The "right" end of MMM-Cl was cloned by cutting it 
with Hindlll and self-ligating. The "left" end of • 
MMM-Cl was cloned by cutting it with Sglll or 5ail and 
self-ligating. 

The "right" end of JcS was made by digesting Jc8 
with EcoRl and self-ligating. The "left" end of JcS 
was made by digesting JcS by 5aII and self-ligating. 

C. elegans RNA was extracted using guanidine 
isothiocyanate (Kim and Horvitz, Genes & Dev. 4:357-371 
(1990)). Poly(A)+ RNA was selected from total RNA by a 
poly(dT) column (Maniatis et al., 1983 supra). To 
prepare stage-synchronized animals, worms were 
synchronized at different developmental stages (Meyer 
and Casson, Genetics 106:29-44 (1986)). 

For DNA sequencing, serial deletions were made 
according to a procedure developed by Henikoff {Gene 
2S:35r-'359 (1984)). DNA sequences were determined 
using Sequenase and protocols obtained from US 
Biochemicals with minor modifications. 



•46- 

The Tel DNA probe for Southern blots was pCe2001; 
which contains a Bergerac Tel element (Emmons et ai.. 
Cell 32:55-65 (1983))- Enzymes were purchased from New 
England Biolabs, and radioactive nucleotides were from - 
5 Amer sham ♦ 

Primer extension procedures followed the protocol 
by Robert E. Kingston (In: Current Protocols in 
Molecular Biology , Ausubel et al . (edsO# Greene 
Publishing Associates and Wiley-Interscience, New York, 
10 p. 4.8,1) with minor modifications. 

Polymerase chain reaction (PGR) was carried out 
using standard protocols supplied by the GeneAmp Kit 
(Perkin Elmer) . The primers used for primer extension 
and PGR are as follows: 



15 Pex2: 5' TCATCGACTTTTAGATGAGTAGAGAACATC 3' 

(Seq. ID #24) ; 
Pexl: 5' GTTGCACTGCTTTCACGATCTCCCGTCTCT 2' 

(Seq. ID #25) ; 
SLl: 5' GTTTAATTACCCAAGTTTGAG 3' (Seq. ID #26); 

2 0 SL2: 5' GGTTTTAACCAGTTACTCAAG 2' (Seq. ID #27); 

Logs: 5' CCGGTGACATTGGACACTC 3' (Seq. ID #28); and 

OligolO: 5' ACTATTCAACACTTG 3' (Seq. ID #29). 

Germline Transf oriaation 

The procedure for microinjection basically follows 
25 that of A. Fire {EMBO J. 5:2673-2680 (1986)) with 

modifications: Cosmid DNA was twice purified by CsCl- 
gradient. Miniprep DNA was used when deleted cosmids 
were injected. To prepare miniprep DNA, DNA from 1.5 
ml overnight bacterial culture in superbroth (12 g 

3 0 Bacto-tryptone, 24 g yeast extract, 8 ml 50% glycerol, 

900 ml H2O, autoclaved; after autoclaving, 100 ml 0.17 
M XH2PO4 and 0.72 M KH2PO4 were added) was extracted by 
alkaline lysis method as described in Maniatis et al. 
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(1983 supra). DNA was treated with RNase A (37°, 30 
minutes) and then with protease K (55°, 30 minutes), 
extracted with phenol and then chloroform, precipitated 
twice (first in 0.3 M sodium acetate and second in 0,1 . 
5 M potassium acetate, pH 7.2), and resuspended in 5 Ml 
injection buffer as described by A. Fire (1986 supra) . 
The DNA concentration for injection is in the range of 
100 ug to 1 mg per ml. 

All transformation experiments used ced-l (el735) j 

10 unc'31(e928) cad'3 (n717) strain. unc-5I was used as a 
marker for co-transformation (Kim and Horvitz, 1990 
supra) . ced-I was present to facilitate scoring of the 
ced-3 phenotype. The mutations in ced-1 block the 
engulf ment process of cell death, which makes the 

15 corpses of the dead cells persist much longer than in 
wild-type animals (Hedgecock et al., Science 220:1277- 
1280 (1983)). The ced-3 phenotype was scored as the 
number of dead cells present in the head of young LI 
animals. The cosmid C10D8 or the plasmid subclones of 

20 ClODS were mixed with C14G10 (unc-31 (+) -containing) at 
a ratio of 2:1 or 3:1 to increase the chances that a 
Unc-31(4-) 'transformant would contain the cosmid or 
plasmid being tested as well. Usually, 20-30 animals 
were injected in one experiment. Non-Unc Fl progeny of 

25 the injected animal were isolated three to four days 
later. About 1/2 to 1/3 of the non-Unc progeny 
transmitted the non-Unc phenotype to F2 progeny and 
established a transformant line. The young LI progeny 
of such non-Unc transformant were checked for the 

3 0 number of dead cells present in the head using Nomarski 
optics . 



-48- 



RESULTS 

Isolation of Additional ced-3 Alleles 

All of the ced-3 alleles that existed previously 
were isolated in screens designed to detect viable 
5 mutants displaying the Ced phenotype (Ellis and 

Horvitz, Cell 44:817-829 (1986)). Such screens may 
have systematically missed any class of ced-3 mutations 
that is inviable as homozygotes. For this reason, a 
scheme was designed that could isolate recessive lethal 

10 alleles of ced-3. Four new alleles of CQd~3 (nll63^ 
nll64, nll65, nl2B6) were isolated in this way. Since 
new alleles were isolated at a frequency of about 1 in 
2500, close to the frequency expected for the 
generation of null mutations by EMS in an average C. 

15 Blegans gene (Brenner, Genetics 77:71-94 (1974); 

Greenwald and Horvitz, Genetics 95:147-160 (1980)), and 
all four alleles are homozygous viable, it was 
concluded that the null allele of ced-3 is viable • 

Mapping RFLPs near ced-3 

20 Tel is a C. elegra72s transposable element that is 

thought to be immobile in the common laboratory Bristol 
strain and in the Bergerac strain (Emmons et al,, Cell 
32:55-65 (1983)). In the Bristol strain, there are 30 
copies of Tel, while in the Bergerac strain, there are 

25 more than 400 copies of Tel (Emmons et al., 1983 supra; 
Finney, Ph.D. thesis, Massachusetts institute of 
Technology, Cambridge, Massachusetts, 1987) . Because 
the size of the C. elegans genome is small (haploid 
genome size 8 x lo"^ bp) (Sulston and Brenner, Genetics 

30 77:95-104 (1976)), a polymorphism due to Tel between 
the Bristol and Bergerac strains would be expected to 
occur about once every 2 00 kb. Restriction fragment 
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length polymorphisms (RFLPs) can be used as genetic 
markers and mapped in a manner identical to 
conventional mutant phenotypes. A general scheme has 
been designed to map Tel elements that are dimorphic 
5 between the Bristol and Bergerac strains near any gene 
of interest (Ruvkun et al., Genetics, 121:501-516 
(1989)). Once tight linkage of a particular Tel to a 
gene of interest has been established, that Tel can be 
cloned and used to initiate chromosome walking. 

10 A 5.1 kb Bristol-specific Tel EcoRI fragment was 

tentatively identified as containing the Tel closest to 
ced-3. This Tel fragment was cloned using eosmids from 
a set of Tel-containing C. elegans Bristol genomic DNA 
fragments. DNA was prepared from 4 6 such TCl- 

15 containing eosmids, and this DNA was screened using 
Southern blots to identify the eosmids that contain a 
5.1 kb EcoRl Tel-containing fragment. Two such eosmids 
were identified: MMM-Cl and MMM-C9. The 5.1 kb i:coRI 
fragment was subcloned from MMM-Cl into pUC13 

20 (Promega) . Since both ends of Tel contain an EcoRW 
site (Rosenzweig et al.. Nucleic Acids Res. Ii:420l- 
4209 (1983)), i:coRV was used to remove Tel from the 5.1 
kb EcoRI fragment, generating a plasmid that contains 
only the unique flanking region of this Tel-containing 

25 fragment. This plasmid was then used to map the 
specific Tel without the interference of other Tel 
elements . 

unc-30Cel52; ced-3 (n717) dpy-4 (ell66) /+++ males 
were crossed with Bergerac {EM1002) hermaphrodites, and 

30 Unc non-Dpy or Dpy non-Une recombinants were picked 
from among the F2 progeny. The recombinants were 
allowed to self-f ertilize, and strains that were 
homozy^dus for either unc~30 (el91) dpy-4 (Bergerac) or 
unc-30 (Bergerac) dpy-4 (ell66) were isolated. After 

35 identifying the ced genotypes of these recombinant 
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strains, DNA was prepared from these strains, A 
Southern blot of DNA from these recombinants was probed 
with the flanking sequence of the 5.1 kb EcoRI Tel 
fragment. This probe detects a 5.1 kb fragment in 
5 Bristol N2 and a 3.4 kb fragment in Bergerac. Five out 
of five unc'30 ced-3 dpy(^Berg) recombinants, and one 
of one unc-30 (-i-Berg} ced-3 dpy-4 recombinants showed 
the Bristol pattern. Nine of ten unc~30 (-^Berg) dpy-4 
recombinants showed the Bergerac pattern. Only one 

10 recombinant of unc''3 0 (-^Berg) dpy-4 resulted from a 
cross-over between ced-3 and the 5.1 kb Tel element. 
The genetic distance between ced-3 and dpy-4 is 2 map 
units (mu) . Thus, this Tel element is located 0.1 mu 
on the right side of ced-3. 

15 Cosmids MMM-Cl and MMM-C9 were used to test 

whether any previously mapped genomic DNA cosmids 
overlapped with these two cosmids. A contig of 
overlapping cosmids was identified that extended the 
cloned region near ced-3 in one direction. 

2 0 To orient MMM-Cl with respect to this contig, both 

ends of MMM-Cl were subcloned and these subclones were 
used to probe the nearest neighboring cosmid C48D1. 
The "right" end of MMM-Cl does not hybridize to C48D1, 
while the "left" end does. Therefore, the "right" end 
25 of MMM-Cl extends further away from the contig. To 

extend this contig, the "right" end of MMM-Cl was used 
to probe the filters of two cosmid libraries (Coulson 
et ai., Free. Natl. Acad. 5ci . USA 83:7821-7325 
(1986)). One clone, Jc8, was found to extend MMM-Cl in 

3 0 the opposite direction of the contig. 

RFLPs between the Bergerac and Bristol strains 
were used to orient the contig with respect to the 
genetic map. Bristol (N2) and Bergerac (EM1002) DNA 
was digested with various restriction enzymes and 
35 probed with different cosmids to look for RFLPs. Once 
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such an RFLP was found, DNA from recombinants of the 
Bristol and Bergerac strains between cecJ-3 and unc-26, 
and between unc-30 and ced-3 was used to determine the 
position of the RFLP with respect to ced-3, 
5 The "right" end of Jc8, which represents one end 

of the contig, detects an RFLP (nP33) when N2 and 
EM1002 DNA was digested with ITiijdlll, A Southern blot 
of DNA from recombinants between three ced-3 (-hBerg) 
unc-26 was probed with the "right" end of Jc8 . Three 

10 of three +Bergr unc-2^ recombinants showed the Bristol 
pattern, while two of two ced-3 unc~2 6 (-{-Berg} 
recombinants showed the Bergerac pattern. Thus, nP33 
mapped very close or to the right side of unc-2 5. 

The "left" end of Jc8 also detects a Hindlll RFLP 

15 (nP34) . The same Southern blot was reprobed with the 
Jc8 "left" end. Two of the two ced-3 unc-26 (+Berg) 
recombinants and two of the three CBd-'3 (^Berg) unc-26 
recombinants showed the Bergerac pattern. One of the 
three ced-3 f +Bergr; unc-26 recombinants showed the 

2 0 Bristol pattern. The genetic distance between ced-3 

and unc-26 is 0.2 mu. Thus, nP34 was mapped between 
ced-3 and unc-25, about 0.1 mu on the right side of 
ced-3 . 

The flanking sequence of the 5.1 3cb EcoRl Tel 
25 fragment (named nP35) was used to probe the same set of 
recombinants. Two of three ced-3 (-hBerg) unc~26 
recombinants and two of two ced-3 unc-2 6 (-hBerg) 
recombinants showed the Bristol pattern. Thus, nP35 
was also found to be located between CQd-3 and unc-2 6, 

3 0 about 0.1 mu on the right side of ced-3. 

A similar analysis using cosmid T10H5 which 
contains the Hi-ndlll RFLP (nP36) , and cosmid B0564, 
which 'cbntains a HinaiTl RFLP {nP37) , shoved that nP36 
and nP37 mapped very close or to the right of unc-30. 
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These experiments localized the ced-3 gene to an 
interval of three cosmids. The positions of the RFLPs, 
and of ced-3, unc-30 and unc-26 on chromosome IV, and 
their relationships to the cosmids are shown in Figure 
9, It was has been further demonstrated by 
microinjection that cosmids C37G8 and C3 3F2 carry the 
unC'SO gene (John Sulston, personnel communication) . 
Thus, the region containing the ced-3 gene was limited 
to an interval of two cosmids. These results are 
summarized in Figure 9. 

Complementation of ce d-3 by Gennline Transformation 

Cosmids that were candidates for containing the 
ced-3 gene were microinjected into a ced-3 mutant to 
see if they rescue the mutant phenotype. The procedure 
for microinjection was that of A. Fire (EMBO J. 5:2673- 
2680 (1986)) with modifications, unc-31, a mutant 
defective in locomotion, was used as a marker for • 
cotrans formation (Kim and Horvitz, Genes & Dev. 4:357- 
371 (1990)), because the phenotype of ced-3 can be 
examined only by using Nomarski optics, Cosmid C14G10 
(containing unc-31(+)) and a candidate cosmid were 
coinjected into ced-1 (el375) ; unc-31(e928) ced-3 (n717) 
hermaphrodites, and Fl non-Unc transf ormants were 
isolated to see if the non-Unc phenotype could be 
transmitted and established as a line of transf ormants . 
Young LI progeny of such transf ormants were examined 
for the presence of cell deaths using Nomarski optics - 
to see whether the ced-3 phenotype was suppressed. 
Cosmid C14G10 containing unc-31 alone does not rescue 
ced-3 activity when injected into a ced-3 mutant. 
Table 4 summarizes the results of these transformation 
experiments . 

As shown in Table 4, of the three cosmids injected 
(C43C9, W07H6 and C48D1) , only C48D1 rescued the ced-3 
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phenotype (2/2 non-Unc trans formants rescued the ced-3 
phenotype) . One of the transf ormants, nEX2 , appears to 
be rescued by an extra-chromosomal array of injected 
cosmids (Way and Chalfie, Ceil 54:5-16 (1988)), which 
5 is maintained as an unstable duplication, since only 
50% of the progeny of a non-Unc Ced(+) animal are 
non-Unc Ced(4-), Since the non-Unc Ced(+) phenotype of 
the other transformant (nISl) is transmitted to all of 
its progeny, it is presumably an integrated 

10 transformant. LI ced-I animals contain an average of 
23 cell corpses in the head (Table 5). LI ced-I ; ced-3 
animals contain an average of 0,3 cell corpses in the 
head.- ced-I; unc-3I ced-3; nJSI and ced-I; unc-31 
ced-3; nSX2 animals contain an average of 16.4 and 14.5 

15 cell corpses in the head, respectively. From these 

results, it was concluded that C48D1 contains the ced-3 
gene. 

In order to locate ced-3 more precisely within the 
cosmid C48Dr, this cosmid was subcloned and the 

2 0 subclones were tested for the ability to rescue ced-3 

mutants (Table 5) . C48D1 DNA was digested with 
restriction enzymes that cut rarely within the cosmid 
and the remaining cosmid was self-ligated to generate a 
subclone. Such subclones were then injected into a 
25 ced-3 mutant to look for complementation; young LI 
non-Unc progeny of the transformant s were examined 
using Nomarski optics for the presence of cell death in 
the head. When C48D1 was digested with BamEI and self-' 
ligated, the remaining 14 kb subclone (named C48D1-28) 

3 0 was found to rescue the ced-3 phenotype when injected 

into a ced-3 mutant (Figure 10 and Table 5) . C48D1-28 
was then partially digested with Sgrlll and self- 
ligat^d. Clones of various lengths were isolated and 
tested for their ability to rescue ced-3. 
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One clone, C48D1-43, which did not contain a 1,7 
3cb Bglll fragment of C48D1-28, was able to rescue ced-3 
(Figure 10 and Table 5) . C48D1-43 was further 
subcloned by digesting with BamRl and Apal to isolate a - 
5 10 kb BainHI-Apal fragment. This fragment was subcloned 
into pBSKII+ to generate pJ4 0. pJ4 0 can restore ced-3+ 
phenotype when micro injected into a ced-3 mutant. pJ4 0 
was subcloned by deleting a 2 kb Bglll-Apal fragment to 
generate pJ107. pJlO? was also able to rescue the 

10 ced-3 phenotype when microinjected into a ced-3 mutant. 
Deletion of 0.5 kb on the left side of pJ107 could be 
made by Exolll digestion (as in pJ107del28 and 
pJ107del34) without affecting CBd~3 activity; in fact, 
one transgenic line, nEXll , restores full ced-3 

15 activity. However, the ced-3 rescuing ability was 

significantly reduced when 1 kb was deleted on the left 
side of pJ107 (as in pJ107dell2 and pJ107del27), and 
the ability was completely eliminated when a 1.8 kb 
5alI-BgIII fragment was deleted on the right side of 

20 pJ107 (as in pJ55 and pJ56) , suggesting that this Sail 
site is likely to be in the ced-3 coding region. From 
these experiments, ced-3 was localized to a DNA 
fragment of 7.5 kb. These results are summarized in 
Figure 10 and Table 5. 

25 ced-3 Transcript 

pJ107 was used to probe a Northern blot of N2 RNA 
and detected a band of 2.8 kb. Although this 
transcript is present in 12 ced-3 mutant animals, 
subsequent analysis showed that all 12 ced-3 mutant 

3 0 alleles contain mutations in the genomic DNA that codes 
for this mRNA (see below) , thus establishing this RNA 
as a ced-3 transcript. 

The developmental expression pattern of ced-3 was 
determined by hybridizing a Northern blot of RNA from 
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animals of different stages (eggs, LI through L4 larvae 
and young adult) with the ced-3 cDNA subclone pJllS. 
Such analysis revealed that the ced-3 transcript is 
most abundant during esibryonic development, which is 
5 the period when most programmed cell deaths occur, but 
it was also detected during the LI through L4 larval 
stages and is present in relatively high levels in 
young adults • This result suggests that ced-3 is not 
only expressed in cells undergoing programmed cell 
10 death. 

Since ced-3 and ced-4 are both required for 
programmed cell death in C. elegans , one of the genes 
might' act as a regulator of transcription of the other 
gene. To examine if ced-4 regulates the transcription 

15 of ced-3, RNA was prepared from eggs of ced-4 mutants 
(211162^ nl4l6, nl894, and nl920) , and a Northern blot 
was probed with the ced-3 cDNA subclone pJllS • The 
presence of RNA in each lane was confirmed with an' 
actin I probe. Such an experiment showed that the 

20 level of ced-3 transcript is normal in ced-4 mutants. 
This indicates that ced-4 is unlikely to be a 
transcriptional regulator of ced-3 . 

Isolation of -^a ced-3 cDNA 

To isolate cDNA of CBd-3 , pJ4 0 was used as a probe 

25 to screen a cDNA library of N2 (Kim and Horvitz, Genes 
& Dev. 4:357-371 (1990)). Seven cDNA clones were 
isolated. These cDNAs can be divided into two groups: 
one is 3.5 kb and the other 2.5 kb. One cDNA from each 
group was subcloned and analyzed further. pJS5 

3 0 contains the 3.5 kb cDNA. Experiments showed that pJ8 5 
contains a ced-3 cDNA fused to an unrelated cDNA; on 
Northerni blots of N2 RNA, the pJ85 insert hybridizes to 
two RNA transcripts, and on Southern blots of N2 DNA, 
pJ85 hybridizes to more than one band than pJ40 (ced-3 
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genomic DNA) does. pJ87 contains the 2.5 kb cDNA. On 
Northern blots, pJ87 hybridizes to a 2.8 kb RNA and on 
Southern blots, it hybridizes only to bands to which 
pJ4 0 hybridizes. Thus, pJ87 contains only ced-3 cDNA. 
5 To show that pJ87 does contain the ced-3 cDNA, a 

frameshift mutation was made in the Sail site of pJ40 
corresponding to the Sail site in the pJ87 cDNA. 
Constructs containing the frameshift mutation failed to 
rescue the ced-3 phenotype when microinjected into 
10 ced-3 mutant animals, suggesting that ced-3 activity 
has been eliminated, 

ced-3 Sequence 

The DNA sequence of pJ87 was determined (see 
Figure 4; Seq. ID #18), pJ87 contains an insert of 2.5 

15 kb which has an open reading frame of 503 amino acids 
(Figure 4; Seq. ID #19). The 5' end of the cDNA 
contains 25 bp of poly-A/T sequence, which is probably 
an artifact of cloning and is not present in the 
genomic sequence. The cDNA ends with a poly-A 

20 sequence, suggesting that it contains the complete 3' 
end of the transcript. l kb of pJ87 insert is 
untranslated 3' region and not all of it is essential 
for ced-3 expression, since genomic constructs with 
deletions of 380 bp of the 3' end can still rescue 

25 ced-3 mutants (pJl07 and its derivatives, see Figure 
10) . 

To confirm the DNA sequence obtained from the 
cad -3 cDNA and to study the structure of the ced-3 
gene, the genomic sequence of the ced-3 gene in the 
30 plasmid pJ107 was determined (Figure 4; Seq. ID #18). 
Comparison of the ced-3 genomic and cDNA sequences 
revealed that the ced-3 gene has seven introns that 
range in size from 54 bp to 1195 bp (Figure 5A) . The 
four largest introns, as well as sequences 5' of the 
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start codon, were found to contain repetitive elements • 
Five types of repetitive elements were found, some of 
which have been previously characterized in non-coding 
regions of other C. e lagans genes such as fem-l (Spence 
5 et ai.. Cell 50:981-990 (1990)), lin'12 (J. Yochem, 
personal communication) , and myoD (Krause et al . , Ceil 
63:907-919 (1990)) (Figure 4). Of these, repeat 1 was 
also found in fem-I and myoD, repeat 3 in lin-12 and 
fem-'l, repeat 4 in lin'12 , and repeats 2 and 5 were 

10 novel repetitive elements • 

A combination of primer extension and PGR 
amplification was used to determine the location and 
nature of the 5' end of the ced-3 transcript. Two 
primers (Pexl and Pex2) were used for the primer 

15 extension reaction* The Pexl reaction yielded two 
major bands, whereas the Pex2 reaction gave one band. 
The Pex2 band corresponded in size to the smaller band 
from the Pexl reaction, and agreed in length with a 
possible transcript that is trans-spliced to a C. 

20 elegans splice leader (Bektesh, Genes & Dev., 2;1277- 
1283 (1988)) at a consensus splice acceptor at position 
2166 of the genomic sequence (Figure 4) . The nature of 
the larger Pexl band is unclear. 

To confirm the existence of this trans-spliced 

25 message in wild-type worms, total C. elegans RNA was 
PGR amplified using the SLl-Log5 and SL2-Log5 primer 
pairs, followed by a reamplif ication using the SLl- 
OligolO and SL2-01igolO primer pairs. The SLl reaction' 
yielded a fragment of the predicted length. The 

3 0 identity of this fragment was confirmed by sequencing. 
Thus, at least some, if not most, of the ced-3 
transcript is trans-spliced to SLl. Based on this 
result;-'the start codon of the ced-3 message was 
assigned to the methionine encoded at position 2232 of 

3 5 the genomic sequence (Figure 4) . 



The DNA sequences of 12 EMS-induced cedf-3 alleles 
were also determined (Figure 4 and Table 3) . Nine of 
the 12 are missense mutations. Two of the 12 are 
nonsense mutations, which might prematurely terminate 
the translation of ced-3. These nonsense ced-3 mutants 
confirmed that the ced-3 gene is not essential for 
viability. One of the 12 mutations is an alteration of 
a conserved splicing acceptor G, and another has a 
change of a 70% conserved C at the splice site, which 
could also generate a stop codon even if the splicing 
is correct. Interestingly, these EMS-induced mutations 
are in either the N-tsrminal quarter or C-terminal half 
of the protein. In fact, 9 of the 12 mutations occur 
within the region of csd-3 that encodes the last 100 
amino acids of the protein. Mutations are notably 
absent from the middle part of the ced-3 gene (Figure 
5). 

Ced-3 Protein C ontains A Region Rich in Serines 

The Ced-3 protein is very hydrophilic and no 
significantly hydrophobic region can be found that 
might be a trans -membrane domain (Figure 6) . The Ced-3 
protein is rich in serine. From amino acid 78 to amino 
acid 205 of the Ced-3 protein, 34 out of 127 amino 
acids are serine. Serine is often the target of 
serine/ threonine protein kinases (Edelman, Ann. Rev. 
Biochem. 55:567-613 (1987)). For example, protein 
kinase C can phosphorylate serines when they are 
flanked on their amino and carboxyl sides by basic 
residues (Edelman, 1987 supra). Four of the serines in 
the Ced-3 protein are flanked by arginines (Figure 4). 
The same serine residues might also be the target of 
related Ser/Thr kinases. 

To identify the functionally important regions of 
the Ced-3 protein, genomic DNAs containing the ced-3 
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genes from two related nematode species, C, briggsae 
and C. vulgaris were cloned and sequenced (Figure 7; 
Seq* ID i^2 0 and 21) . Sequence comparison of the three 
ced-3 genes showed that the non-serine-rich region of 
5 the proteins is highly conserved. In C. briggsae and 
C. vulgaris, many amino acids in the serine-rich region 
are dissimilar compared to the C. elegans Ced-3 protein 
(Figure 7) , It seems that what is important in the 
serine-rich region is the overall serine-rich feature 

10 rather than the exact amino acid sequence. 

This hypothesis is also supported by analysis of 
ced-3 mutations in C. elegans: none of the 12 EMS- 
induced mutations is in the serine-rich region, 
suggesting that mutations in this region might not 

15 affect the function of the Ced-3 protein and thus, 

could not be isolated in the screen for ced-3 mutants. 
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Tabla 1 
Raacue of tha Cad-4 
Phanot-ypa by Garmlina Transformation 



Ganotypa 



No. 

DNA Avg. No. Call Animals 

Injactad Corpses (Ll Head) Scored 



csd-1; ced-4; C10D8; 
unc-31; nSxl C14G10 



9-4 



10 



ced-l; cad-4; 
unc-31; nEx7 



C10D8-5 
C14G10 



11.5 



10 



cad-1; ced-4 
unc-31; nEx8 



C10D8-5 
C14G10 



11.5 



10 



ced-1 



None 



23 



20 



ced-1; ced-4 



None 



0.6 
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Table 4 

8-ummarv of Transformation Ex par^mw'n-ha 
Using Coamida in tha ced-3 Region 



Cosaid 
ini acted 

C43C9; C14G10 

W07H6; C14G10 



C48D1;- C14G10 



No. of non-Unc 
tranaf oraants 

1 

3 



Ced-3 

phenotvoe 



+ 
+ 



Strain name 



MT4302 

MT4299 
MT4300 
MT4301 

MT4298 
MT4303 



Animals injected were of genotype: ced-1 (el735) ; unc-3i(e929) 
ced-3 (n717} . 
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Tabl« 5 

The •zprassioD of ced-3 f+) transformants 



Ganotypa 



DNA injectad 



Xvarag* No. 
call daaths 
in LI haad 



No. 

Animals 
scored 



ced-1 



23 



20 



ced-1; ced-J 



0.3 



10 



ced-1; nISl 
unc-31 ced-3 



C48D1; 
C14G10 



16.4 



20 



ced-1; unc-31 
ced-3; nISl/+ 



14.5 



20 



ced-1; unc-31 
ced-3; nEX2 



C48D1; 
C14G10 



13.2 
0 



10/14 
4/14 



ced-1; unc-31 
ced-3; nEXlO 



C48D1-28; 
C14G10 



12 
0 



9/10 

1 Of 10 



ced-1; unc-31 
ced-3; nEX9 



C48D1-28; 
C14G10 



12 



10 



ced-1; unc-31 
ced-3; nEXll 



C48D1-43 
C14G10 



16.7 



10/13 



Abnormal cell 3/13 
deaths 



ced-1; unc-31 
ced-3; nEX13 



pJ40; C14G10 



13.75 



4/4 



Tab la 5 continuad 



-65- 



ced-1 ; tLnc-3 1 
ced~3; nSX17 



ced-1; unc-Sl 
ced-3; nEX18 



ced-1; unc-31 
ced-3; nEX19 



ced-1; unc-31 
ced-3; nEXie 



pJ107del28, 

pJ107del34 

C14G10 



pJ107del28, 
pJ107dell34 
C14G10 



pJ107del28, 

pJ107del34 

G14G10 



pJ107dell2, 

pJ107del27 

C14G10 



23 



12.8 



10.6 



7.8 



12/14 

2/14 
9/10 

1/10 
5/6 

1/6 
12/12 



Alleles of the genes used are ced-1 (el735) , unc-31 (e928) , and ced- 
3 (n717) . 
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Ecruivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
5 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
followiAg claims. For example, functional equivalents 
of DNAs and RNAs may be nucleic acid sequences which, 
through the degeneracy of the genetic code, encode the 

10 same proteins as those specifically claimed. 

Functional equivalents of proteins may be substituted 
or modified amino acid sequences, wherein the 
substitution or modification does not change the 
activity or function of the protein. A "silent" amino 

15 acid substitution, such that a chemically similar amino 
acid (e.g., an acidic amino acid with another acidic 
amino acid) is substituted, is an example of how a 
functional equivalent of a protein can be produced. 
Functional equivalents of nucleic acids or proteins can 

20 also be produced by deletion of nonessential sequences. 
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CLAIM5 



Isolated DNA which is the ced-3 gene. 

Isolated DNA having the nucleotide sequence of 
Figure 4 (Seq. ID #18) . 

Isolated DNA encoding the amino acid sequence of 
Figure 4 (Seq. ID #19). 

Isolated RNA encoded by the DNA of Claim 1. 

Isolated protein encoded by the DNA of Claim 1. 

Isolated protein having the amino acid sequence of 
Figure 4 (Seq. ID #19). 

An antibody directed against the protein of claim 
6. 

Isolated DNA which is a mutated ced-3 or ced-4 
gene "having a mutation which affects the activity 
of the gene. 



The DNA of Claim 8, wherein the mutated ced~4 gene 
is selected from the group consisting of: 

a) nll62 

b) 112274 

c) nl920 

d) n2247 

e) n2273 

f) nl948 

g) ^T nl947; and 

h) nl894. 



The DNA of Claim 8, wherein the mutation in cQd-4 
results in an alteration selected from the group 
consisting of: 

a) Q to termination at codon 40; 

b) R to termination at codon 139; 

c) I to N at codon 258; 

d) Q to termination at codon 262; 

e) W to termination at codon 4 01; and 

f ) an alteration in mRNA splicing resulting from 
a change at nucleotide 6297, 

The DNA of Claim S, wherein the mutation in ced-4 
is selected from the group consisting of: 

a) C to T at nucleotide 1131; 

b) C to T at nucleotide 1428; 

c) G to A at nucleotide 1929; 

d) T to A at nucleotide 2117; 

e) C to T at nucleotide 2128; and 

f) G to A at nucleotide 3131. 

The DNA of Claim 8, wherein the mutated ced-3 gene 
is selected from the group consisting of: 



a) 


nl040j 


b) 


n718; 


c) 


n2433; 


d) 


nil 64; 


e) 


71717; 


f) 


nl949; 


g) 


nl236; 




nll29; 


i) 


nll65} 


j) 


n243 0; 


3c) 


n2426; and 


1) 


nil 63. 
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The DNA of Claim 8 , wherein the mutation in ced-3 
results in an alteration selected from the group 
consisting of: 

a) L to F at codon 27; 

b) G to R at codon 65; 

c) G to S at codon 360; 

d) Q to termination at codon 403; 

e) Q to termination at codon 417; 

f) W to termination at codon 428; 

g) A to V at codon 449; 

h) A to V at codon 466; 

i) E to K at codon 483; 

-j) S to F at codon 486; and 

k) an alteration in mRNA splicing at nucleotide 
6297. 

The DNA of Claim 8, wherein the mutation in ced-3 
is selected from the group consisting of: 



a) 


C 'to 


T 


at 


nucleotide 


2310; 


b) 


G 


to 


A 


at 


nucleotide 


2487; 


c) 


G 


to 


A 


at 


nucleotide 


5757; 


d) 


C 


to 


T 


at 


nucleotide 


5940; 


e) 


G 


to 


A 


at 


nucleotide 


6297; 


f) 


C 


to 


T 


at 


nucleotide 


6322; 


g) 


G 


to 


A 


at 


nucleotide 


6342; 


h) 


C 


to 


T 


at 


nucleotide 


6434; 


i) 


c 


to 


T 


at 


nucleotide 


6485; 


j) 


G 


to 


A 


at 


nucleotide 


6535; 


3c) 


C 


to 


T 


at 


nucleotide 


7020. 



Isolated RNA encoded by the DNA of Claim 8. 



Isolated protein encoded by the DNA of Claim 8* 



Isolated DNA which is a gene selected from the 
group consisting of; 

a) a gene which is structurally related to the 
ced-3 gene; 

b) a gene which is functionally related to the 
ced-3 gene; 

c) a gene which is both structurally and 
functionally related to the ced-3 gene; 

d) a gene which is structurally related to the 
ced-4 gene; 

e) a gene which is functionally related to the 
ced-4 gene; and 

f) a gene which is both structurally and 
functionally related to the ced-4 gene. 

Isolated RNA encoded by the DNA of Claim 17, 

Isolated protein encoded by the DNA of Claim 17. 

An antibody directed against the protein of Claim 
19. 

A probe for identifying a gene which is 
structurally related to the ced-3 gene, said probe 
which is selected from the group consisting of: 

a) DNA having all or a portion of the nucleotide 
sequence of Figure 4 (Seg, ID #18) ; 

b) RNA encoded by the DNA of a) ; 

c) degenerate oligonucleotides derived from a 
portion of the amino acid sequence of Figure 
4 (Seq, ID #19) ; and 

d) an antibody directed against the protein of 
c) • 



A probe for identifying a gene which belongs to 
the same gene family as the ced-3 gene, said probe 
which is selected from the group consisting of: 

a) all or a portion of a gene which is 
structurally related to C8d-3 ; 

b) RNA encoded by a) ; 

c) DNA having the consensus sequence of a 
conserved region between at least two other 
genes which belong to said gene family; 

d) RNA encoded by c) ; 

e) degenerate oligonucleotides derived from a 
portion of the amino acid sequence of a 
protein encoded by a) ; 

f) degenerate oligonucleotides dervied from the 
consensus sequence of a conserved region 
between the proteins encoded by at least two 
other genes which belong to said gene family; 
and 

g) an antibody directed against all or a portion 
of a protein encoded by a) . 

A method for identifying a gene which is 
structurally related to a cell death gene selected 
from ced-3 and CBd-4 , comprising the steps of: 

a) combining DNA with a nucleic acid probe 
comprising said cell death gene, or a portion 
able to specifically hybridize to said cell 
death gene, under conditions suitable for 
specific hybridization of the nucleic acid 
probe to complementary sequences; and 

b) detecting specific hybridization of the 
nucleic acid probe to the DNA, wherein 
specific hybridization indicates that a 
structurally related gene, or portion, is 
present in the DNA, 



thereby identifying a gene which is structurally 
related to a cell death gene selected from ced-3 
and ced-4. 

The method of Claim 23, wherein the DNA is a gene 
library • 

The method of Claim 23, wherein the nucleic acid 
probe further comprises degenerate 
oligonucleotides derived from the amino acid 
sequence of the product of the cell death gene. 

A method for identifying a gene which is 
structurally related to a cell death gene selected 
from ced-3 and ced-4, comprising the steps of: 

1) combining nucleic acid with primers 
comprising portions of said cell death gene 
under conditions suitable for polymerase ' 
chain reaction; and 

2) detecting specific DNA amplification, wherein 
specific DNA amplification produces a 
'structurally related gene, or portion, 

thereby identifying a gene which is structurally 
related .to a cell death gene selected from ced-3 
and ced-4. 

The method of Claim 26, wherein the primers 
further comprise degenerate oligonucleotides 
derived from the amino acid sequence of the 
product of the cell death gene. 
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A method for identifying a gene which is 
structurally related to a cell death gene selected 
from ced-3 and ced-4, comprising the steps of: 

a) combining an expression gene library with an 
antibody directed against the protein encoded 
by said cell death gene under conditions 
suitable for specific antibody-antigen 
binding of the antibody to antigens expressed 
from the gene library; and 

b) detecting specific antibody-antigen binding, 
wherein specific antibody-antigen binding 
indicates that a structurally related gene is 
present in the expression gene library, 

thereby identifying a gene which is structurally 
related to a cell death gene selected from ced-3 
and ced-4. 

A bioassay for identifying a cell death gene, 
comprising the steps of: 

a) using a gene and a nematode selected from a 
nematode having reduced activity of a cell 
death gene and a wild-type nematode to 
produce a transgenic nematode; and 

b) determining in said transgenic nematode an 
increase in cell deaths which occur during 
the development of the nontransgenic 
nematode, wherein an increase in cell deaths 
indicates the activity of a cell death gene, 

thereby identifying a cell death gene. 

The bioassay of Claim 29, wherein the nematode 
underexpresses or expresses an inactivated form of 
a' gene selected from ced-3 and ced-4 . 
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The bioassay of Claim 29, wherein the gene is fron 
an organism other than a nematode. 

The bioassay of Claim 29, wherein the gene is a 
component of an expression gene library. 

Isolated DNA which is a cell death gene identified 
by the bioassay of Claim 29. 

A bioassay to identify a mutation in a cell death 
gene which alters the activity of the gene, 
comprising the steps of: 

a) using a mutated cell death gene and a 
nematode selected from a nematode having 
reduced activity of a cell death gene and a 
wild-type nematode to produce a transgenic 
nematode; and 

b) comparing cell deaths which occur during "the 
development of the transgenic nematode having 
the mutated gene with those which occur in a 
transgenic nematode having a non-mutated 
gene, wherein a difference in cell deaths 
indicates that the mutation alters the 
activity of the cell death gene, 

thereby identifying a mutation in a cell death 
gene which alters the activity of the gene. 

Isolated DNA which is a cell death gene having a 
mutation identified by the bioassay of Claim 34. 

The isolated DNA of Claim 35, wherein the mutation 
has a result selected from the group consisting 
of: 

a) inactivation of the cell death gene; 



b) constitutive activation of the cell death 
gene; and 

c) production of a mutated gene which does not 
cause cell death and which antagonizes the 
activity of functioning cell death genes* 

A bioassay for identifying a gene which affects 
the activity of a cell death gene, comprising the 
steps of: 

a) using a gene and a nematode containing a cell 
death gene to produce a transgenic nematode; 
and 

b) determining in said transgenic nematode a 
difference in cell deaths from cell deaths 
which occur during the development of the 
nontransgenic nematode, wherein a difference 
in cell deaths indicates a gene which affects 
the activity of a cell death gene, 

thereby identifying a gene which affects the 
activity of a cell death gene. 

The bioassay of Claim 37, wherein the cell death 
gene is selected from the group consisting of: 

a) a wild-type gene; 

b) an underexpressed gene; 

c) a gene having reduced activity; 

d) an overexpressed gene; and 

e) a gene having hyperactivity. 

The bioassay of Claim 37, wherein the gene is a 
component of an expression gene library. 

'An isolated gene identified by the bioassay of 
Claim 37. 



A bioassay for identifying an agent which mimics 
the activity of a cell death gene, comprising the 
steps of: 

a) introducing an agent into a nematode selected 
from a nematode having reduced activity of a 
cell death gene and a wild-type nematode; and 

b) detecting an increase in cell deaths which 
occur in the nematode, wherein an increase 
indicates that the agent mimics the activity 
of a cell death gene, 

thereby identifying an agent which mimics the 
activity of a cell death gene. 

The bioassay of Claim 41, wherein the nematode 
underexpresses or expresses an inactivated gene 
selected from ced-3 or ced-4 . 

The bioassay of Claim 42, wherein the agent is 
introduced into the nematode by a method selected 
from: microinjection, diffusion, ingestion and 
shooting in with a particle gun. 

An agent identified by the bioassay of Claim 41, 

A bioassay for identifying an agent which affects 
the activity of a cell death gene, comprising the 
steps of : 

a) introducing an agent into a nematode which 
expresses a cell death gene; and 

b) detecting a change in the pattern of cell 
deaths which occur in the development of the 
nematode, wherein a change indicates that the 
agent affects the activity of the cell death 
gene, 



thereby identifying an agent which affects the 
activity of a cell death gene. 

The bioassay of Claim 45, wherein the nematode 
expresses an endogenous cell death gene or a cell 
death gene which is a transgene. 

The bioassay of Claim 46, wherein the cell death 
gene is ced-3 or ced-4. 

The bioassay of Claim 45, wherein the nematode 
overexpresses or underexpresses the cell death 
gene. 

The bioassay of Claim 45; wherein the nematode 
expresses an inactivated or constitutively 
activated form of the cell death gene. 

The bioassay of Claim 45, wherein the nematode 
underexpresses or expresses an inactivated form of 
a gene selected from ced-3 and ced-4 • 

An agent identified by the bioassay of Claim 45. 

The agent of Claim 47 which is selected from the 
group consisting of: 

a) single stranded nucleic acid having all or a 
portion of the antisense sequence of the cell 
death gene which is complementary to the mRNA 
encoded by the gene; 

b) DNA encoding a) ; and 

c) an antagonist of the cell death gene. 



A method for altering the occurrence of cell 
death, comprising altering in the cell the 
activity of a cell death gene. 

The method of Claim 53, wherein the cell death 
gene is ced-3 or ced-4. 

The method of Claim 53, comprising exposing the 
cell to an agent which alters or mimics the 
activity of a cell death gene in the cell under 
conditions appropriate for activity of the agent. 

The method of Claim 55, wherein the activity of 
the cell death gene is increased, comprising 
exposing the cell to an agent selected from the 
group consisting of: 

a) DNA comprising the cell death gene, or active 
portion thereof; 

b) RNA encoded by the cell death gene, or active 
portion thereof; 

c) protein encoded by the cell death gene, or 
active portion thereof; 

d) an agent which is structurally similar to and 
mimics the activity of the protein encoded by 
the cell death gene; 

e) DNA comprising a constitutively activated 
form of a cell death gene, or active portion 
thereof; 

f ) SNA encoded by the DNA of e) , or active 
portion thereof; 

g) protein encoded by the DNA of e) , or active 
portion thereof; 

h) an agent which is structurally similar to and 
mimics the activity of the protein encoded by 
the DNA of a) ; and 



i) an agonist of the cell death gene, 

under conditions appropriate for the activity of 

the agent* 

The method of Claim 55, wherein the activity of 
the cell death gene is decreased, comprising 
exposing the cell to an agent selected from the 
group consisting of: 

a) single stranded nucleic acid having all or a 
portion of the antisense sequence of the cell 
death gene which is complementary to the mRNA 
of the gene; 

b) DNA which directs the expression of a) ; 

c) a mutated cell death gene which does not 
cause cell death and which antagonizes the 
activity of the cell death gene; 

d) RNA encoded by c) ; 

e) protein encoded by c) ; and 

f) an antagonist of the cell death gene, 
under conditions appropriate for the activity of 
the agent, 

A method for reducing the proliferative capacity 
or size -'of a population of cells, comprising 
increasing the activity of a cell death gene in 
the cells. 

The method of Claim 58, wherein the cells are 
selected from: 

a) cancerous cells; 

b) infected cells; 

c) cells producing autoreactive 
''-^ antibodies; and 

d) hair follicle cells. 



The method of Claim 58, wherein the cell death 
gene is selected from the group consisting of: 

a) ced-3; 

b) a cell death gene which is structurally 
related to ced-3; and 

c) a gene which is functionally related to 
ced-3 . 

The method of Claim 58, wherein the cell death 
gene is selected from the group consisting of: 

a) ced-4; 

b) a cell death gene which is structurally 
related to ced-4; and 

c) a gene which is functionally related to 
ced-4, 

A method for treating a condition characterized by 
cell deaths, comprising decreasing the activity of 
a cell death gene. 

The method of Claim 62, wherein the condition is 
selected from the group consisting of: 

a) myocardial infarction; 

b) stroke; 

c) degenerative disease; 

d) traumatic brain injury; 

e) hypoxia; 

f) pathogenic infection; 

g) aging; and 

h) hair loss. 
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64* A method for treating a parasitic infection of a 
host animal, comprising administering an agent 
which increases the activity of a cell death gene 
specific to the parasite and which does not harm 
5 the host animal. 

65. The method of Claim 64, wherein the parasite is a 
nematode. 

66. A method for incapacitating or killing a pest, 
comprising increasing the activity of a cell death 

10 gene in the pest. 

67. A method of biological containment of a 
recombinant organism, comprising introducing in 
the organism nucleic acid which is able to direct 
the expression of an agent which increases the 

15 activity of a cell death gene in the organism- 

under predetermined conditions, thereby 
incapacitating or killing the recombinant 
organism. 

68. The method of Claim 67, wherein the agent kills 
2 0 the recombinant organism upon completion of a 

desired task by the organism. 
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CLONING, SEQUENCING AND CHARACTERIZATION 
OF TWO CELL DEATH GENES AND USES THEREFOR 
Abstract of the Disclosure 

Described herein are genes shown to be essential 
5 for programmed cell death in C. elegans, their encoded 
products (RNA and polypeptides) , antibodies directed 
against the encoded polypeptides; probes for 
identifying structurally related genes and bioassays 
for identifying functionally related cell death genes 

10 from various organisms; methods and agents for altering 
(increasing or decreasing) the activity of the cell 
death-genes and, thus, of altering cell death; and uses 
therefor. Specifically, two genes shown to be 
essential for almost all of the cell deaths which occur 

15 in the development of C, elegans , referred to as ced-3 
and ced-4, have been cloned, sequenced and 
characterized. 



CAATTCCCGTCSAATCATTCrrCTGTTCGCTAICSAXTCAGAAACCCAAACTTCTGATCGA 

TAACAAC:rCATTCAAACACCC;CGAAGATC3TCTAIGCGTAXAACAGAATArrTGGAGAAAT 

CCTCGCAAAACTCGAAATTCTCACCGATAAAATGATXAACTTGAACGCGCrAATGTAAGT 
121 + > + + — : + ^ 3^3Q 

TAICTGATGTTTCTACAAXTAAAAAAATTGTTTTTTTTTCCAAATTAATTTTCGAACATT 
131 + H + ^ ^ ^ 240 

AACGAAAAACG AXXAAAAAXCAAXAAAACGCAAX AAAGAGGGCXXCC C X 1 1 L 1 1 T T TA AX 
241 + H + + ^ ^ 

AAXAXXCXAXXACACCGXXCCGAXXCXCCGCAAAXAXCACCGACACXCCAACAXXXACAA 
3 SI + + + + ^ ^ ^ ^20 

CAACGACGXGTCACAAXCACXAAGXCAAACACGGAAAGGAXAAACGAXXCXGAXArrXCA 

CTCXXXXACXCAXXCGCXXTrXAAAXAACAACXAXAXGCCGAXXXGCCCAXAXAXXXTTG 
431 ^ ^ ^ ^ ^ ^ 

XXXAXXACGCCXCXCACAXXCCXGXACAAXGTrXCXACCAAAXAAACXGCAXXXXXAXCX 

541 + + + + + ^ gQQ 

CAAAAXXCGAAXXXATXXXXGXCXACXXXXXACXCGXXGCAXXCCACAXCAGCAXAXCXX 



601 



6 SO 



CCGGXCXAXXXAXAXXCAACGAXXXXXAXAAAXXAGXACXCCXXCAXGXXXAAXXXCAXX 
SSI * * ^ + ^ ^ 720 

XXAXCXGXAACCXXXACXGXAXXXXXXXAAAAXCXXXCXXGCXXCXAXCXGAXXAXACAA 
72X + ^ ^ ^ ^ 

TGXXCXXXACTCAXXXXCAACGXAXXXXXAXGCCXCACAATXXAXGCACAXXXCGGGCXX 

CGAGAXXXATCCXCXAXAXXACAXGCCXGXXrrXXXAAAGGAXAXAAXGXXXAACAAAXA 
841 + : + + + + 9QQ 



AXTTXXXAXCAATCCXAXXGXAXAXTCXCCAGCXAACCGXXGXXXCGAAAACAXCACCXA 
™+ + ^ ^ ^ ^ 

CCATTTXAAAAXXCACAAAAXCXXGCXXCCXXAXAAXCAAGAAGAXXXXXCAGAXGCTCX 



9S1 ^ + + + + ^ 1Q20 

M L C 

dlGAAAXCGAAXGCCCCGCXXXGAGCACGGCACACACGAGGCXCAXCCACGACrrXGAAC 

1021 + ^ + ^ ^ ^ IQQQ 

EISCRALSXAHXRLIHDFEP 
10 20 

X nll€2 

CACGXGACGCAXXGACXXAXXXAGAAGGCAAAAACAXXXXCACACAACAXCAXXCXGAAC 
1031 + ^ ^ ^ ^ ^ ^^^^ 

RDALXYLSGXHIFXEDHSSL 
30 40 

XXATCAGXAAAATCXCAACTCGCCXCGACAGGAtCGCCAAXXXXCXXCCAAXCXAXCGAC 
1141 + — + ^ + + ^ 1200 

iSXMSXRLSRIANFLRirRR 

SO so 

GTCAAGCXXCXGAACXXGGACCACTCAXCGACXXXXXCAACXACAACAAXCAAAGXCACC 

QASSLGPLIDFrN^HNQSHL 
70 30 
XTGCXGAXXXCCXCGAAGACXACAXCGAXTXTGCGAXAAAXGACCCACAXCXACXXCGXC 

ADFLSOriOFAINEPDLLR? ^^^^ 
90 100 

CACTAGXGAXXGCXCCACAAXXXXCCCGACAAAXCCXCCAIAGGAAACXAXXCCXXGGGA 
1321 . ^ . * ^ , ^330 

VVIAPQFSRQMLDRXLLLGN 
110 120 
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T n2214 

ATCTTCCAAAACAAATGACATGCTATATTCGAGAGTATCACGTGCATlcACTCArCAAAA 

VPKQMTCriREYaVDRVIXK 
130 140 

ilntron 1 
TGAGAAAACTCCAACCTCTCCTGTtTATTATAATC 

LDEMCDLO 
TTGCTTAAACTTCAGkcTCCTTTTTTCTGTTTCTACACCGCCGACCTCGArCCCGAAAAT 

SFFLFLSGRACSCKS 

ISO 

AIntroa 2 
TGAGTGCTAT 

ISfil ^ + + + + + 1S20 

VIASQALSKSDQLIGI 
170 I ISO 
TATCTGAATCTACGGAXCTTCAXTCTATTACAGAAATTATGATTCAATCCTTTGGCTCAA 
1S21 ^ + ^ ^ + * * 1S80 

NYDSIVWLK 
130 

AGATAGTGGAACAGCTCCAAAATCTACATTCGATTTATTTACGGATATTTTGCTGATGCT 
ISai + + + + + + 1740 

DSGTAPKSTFDLFXDILLML 

200 210 

A nl920/n2247 



Inrron 3* 
AAPpiC 



rGAGTGAATAGAGTGCATGTAACATTCAGCATGAtTTTGAAATTATGAAAATTrGA 

1741 + + + + + 1800 

K 

CCTGGTTAGCTTTTAATTTGATATTTCGTGACGCTTGCATGTTTTGTGTGTTTGAAGACG 

1301 — + + + A 4. 1350 

AGCCCGTGTTGTGAGCGACACGGATGACTCGCATTCCATCACCCACTTCATTAACCGTGT 

13 SI -t- ^ + + + + 1920 

113 



A. n2272 
rCTTTCAAckAGCGA 



TCTTTCAAGAAGCGAAGACGATCTTCTCAATrrCCCATCCGTGGAGCATGTCACCTCAGT 

SEDDLLNFPSVSHVTSV 
220 

j Incron 4 

TGTACTCAAAAGGATGpTAAGTTGCrrGCCGATTCTGGTACAATATCTTAAATTATTGGT 

1931 + + -I- + ^ ^. 2040 

V L X .M 
230 I 

TTTTAG^TCTGCAACGCACTCAtTGATCGTCCAAATACTTTArrCGTATrrGATGACGTA 

ICNALIDRPNTLFVFDOV 
240 250 
A nl$4B T al$47 

\ \ 

GTrCAAGAAGAAACAATTCGTTCGGCTCAGGAGCTACGTCTiCGATGTCrrGTAACTACT 
2101 + + + + 21S0 

VQSSTIRWAQELRLRCLVTT 

2S0 270 

CGTGACGTGGAAArAXCAAArGCTGCTTCTCAAACATGCGAATTCATTGAAGTGACATCA 
2131 + + + + + + 2220 

ROVIISNAASQXCEFISVTS 

230 290 
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TTCGAAATCCATGMTGIIATGATTTTCTAGAAGCTTATCGAATCCCCATGCCTCTTGGA 
2221 + + + + ^ ^ 2280 

LEIDECrOFLEAYCMPMPVG 

300 

Tci' nX41S 



TAicCACTC 



CAAAAAGAAGAAGATCTCCTTAATAAAACAATCCAACTAXCCACTCGAAATCCACCAACC 
2291 ^ . ^ ^ ^ ^ 234 0 

EKSEOVLNXTIELSSCNPAT 

220 

tlntron 5 

23 1 

LMMrrXSCEPKTFEK * ^^^^ 

330 

AArTTGAaACTTTTAAAATAATTTArrCTACAATAAAAGTTAATCAAAAACTTTCAlACC 
2401 > . ^ ^ , ^ 24 60 

TCArrGTCTTTAAATTTTACGAATTGAGGATCAAAAICAACAATTAGGATCCTCGCACGA 

GAGAAAACTGTCTACCTACCCTACCCGAGAG ArrTTCTTGATATTTCCCATCCATTTAAT ^ ^ 

TTrTTAAGAAAATTAXCGTTTTACATAATTGAACAAGAGATACACGGTCTCGACCCCACG ^^^^ 

GAAATTTrrTAAATCAAAGCCAGTATCAGCCTGTTTTCATTATTTTTCGATTTTCTCTTG 

TTGITTCTTTTTATTTAAACCCTTTTATTTTGAAACAAGTCTAAAAAXATTAAAAACTG A ^ ^ ^ 
2701 + ^ ^ ^ ^ 2760 

ATAAAAIATTTAAAAAAAATCAAGTAAAATAGAAAAACAGCAAGGCTCGACACTACTGTA 

CTTCTTAAATCCGCAXACTCTTTTTATTTAATCATTTTCCGGAATGTCGAAACCAAATAA ^^^^ 

r ACAXTTTT AGTCCAAAATCGCT AGGTATATTCTT AAAATT ATCAAACATTTTCCATTCA ^ ^ ^ ^ 
2331 -| * ^ + + ^ ^ 2 940 

C^XGGCACAGCXXAAXAACAAAXTCGAAAGXCGAGGATXAGXCGGXGXXGAAXCXAXCA 
2341 + + + + + ^ 3000 

MAQLNNXLESRGLVGVECIX 
340 350 

CCCCXXACTCGXACAACXCACTCGCAAXGGCXCTXCAAAGATGXGTXGAAGTXXXGXCAG 

PYSrKSLAMALQRCVEVLSD 
360 370 

AXGAGGATCGAAGXGCXCXTGCXXTCGCAGTTCXGAXCCCTCCTCGAGXXGATAXACCCG 
30 51 -h + + ^ ^ 3^20 

EDRS.ALArAVVMPPGVDiPV 
330 390 
A nl394 

t 

TCAAGCTAXGGXCATGTGXTAXTCCAGTXGAXAXTTCTXCAAATGAAGAAGAACAATTGG 

XL^WSCVIPVDICSNSESQLD ^^^^ 
400 410 

i Inrron 6 

ATGAXGAAGXTGCGGATCCCXXGAAAACACTCACCA^trrATGAGTCTTGAAAXTTGAAGA 

3131 + ^ * > + ^ 324 0 

DEVADRLXRLSX 
420 I 

TTTAAAXXAACACXXAAAAXTTCAGkcGTGGAGCTCTTCXCAGTGGAAAACGAATGCCCG 

RGALLSCXRM?V 
430 440 
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TTTTCACATTCAAAAITGATCATATTATCCATATCTTCTTCAAACACCTCGTTGATCCAC 

LTFXIDailHKFLKttVVDAQ 

iSO 460 

i Inrron 7 I 
TATGCTSAAAAIGTCTCAACTTTCAATTAAATtTTAAATTTTCAGkAT 

3361 + + -t- + K + 3420 

T I A N 
CCAATCTCAATTCTCCACKZACCCTCTTCTTGAAATAGCAAACAATAATCTATCACT^ 

3421 ^ + +- + 3430 

CISILSQRLLEIGNNKVSV? 

470 480 
CACCCACATATACCATCACATTTCCAAAAATTCCGTCCT7CATCAGCCACTGAGATGTAT 

34 31 + + + + 354 0 

SRHIPSEFQXFRRSSASEMY 

500 510 
CCAAAAACTACACAACAAACTCTGATCCGTCCrGAACACTTCCCAAAGTTCATCCAArTG 

3541 + ^ ^ + + + 3 600 

?XTTS£TVIRP2DF?KFMQL 

520 530 
CACCAGAAATTCTATGACTCCCTCAAAAATTTTCCATGCTGTIAAAACCTATCGTGTACA 

HQXF.'JfDSLXNFACC* 
540 

ATATTGCCTGTATATtCCCCTCGAAATACGTTTATACTTTTTCGCACGAGTTTTCTCAXT 
3661 + + + + + + 3720 

TTTTCATTTGTACTTGTTTTAXTTCTCTCCAAAATTTCAGATCTATCCCAAATGTTCTTA 

3721 + + + + + + 3780 

AATTTAATGTTTTCTACAGATACTCAACACAtCTTGTrTCATCTCATCCTTGCTTTTTTT 
3731 + + + + + + 3840 

rrrCAAATATATTCAGTTTCTTTTATAATTTTAATTAATCGAAXTAATACATTCACGTAA 

AGAAXTTCGTGCACIATTArrTTAXCGCATCCAAATGATTTAXTCCCTArrGTTCGAAAC 
3901 + + + + + + 3960 

XXCCAAAXXGAXCAXXXTTAAACACGCCXCATTAAAXTGAAAGXCGXACrrXTAGXCXCG 
3961 + + + + + + 4 020 

AACAXGAAGTAAGTTArrXXCXGXGTXCTAAAXTCAAAGXGCAXXCCAAAACGACAXXXG 

AXGAGTTXTCACGAAAACCGTAAXTTTXACAAXTXCCTTXCACXXXTGAAGAXGTXCGAX 
4 0 31 + + -f + + + 4140 

XXCXXXCCXCTGTXGCCCXCAXXACXACAXXXCCXTXGCXCCXTCACTTtAXCGAGAXXC 
4141 + + + + + + 4200 

TXGCCAXCAAXCGAGXXCCAXCXAGACCGATAGCAGXCTTCAXAXCATXATCCCXGXAXA 
4201 + + ■¥ + + 4260 

XXGXACXGtXTCAGXAXXXTAACXTAXCGAXXACGXACXAXAXXCAGXGGXTCACXGTXX 
4261 + + + + + + 4320 

XCGGXCAAXCGGXGACACCXCCXCCACGAKNAAX.XTXCAACCAACGCAAXCXCCXACXCA 
4321 + — : + -t. ■ + ^ + 4 330 

CrrAXCAACCAACAGCCCXCACCCATG 
4331 ^ ^ 4407 
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ced'3 Genomic Sequence 



XGATCTGAAATAAGGTGATAAATTAATAAATTAAGTGTATTTCTGAGGAAATTTGACTGT 
I + + ^ + + + €0 

TTTAGCACAATTAATCTTGTTTCAGAAAAAAAGTCCAGTTTTCTAGATTTTTCCGTCTTA 
61 + + + + (. + 120 

TTGTCGAATTAATATCCCTATTATCACTTTTTCATGCTCATCCTCGAGCGGCACGTCCTC 
121 + + + + (- + 180 

AAAGAATTGTGAGAGCAAACGCGCTCCCATTGACCTCCACACTCAGCCGCCAAAACAAAC 
181 + + + + + + 2 40 

GTTCGAACATTCGTGTGTTGTGCTCCTTTTCCGTTATCTTGCAGTCATCTTTTGTCGTTT 
241 + + + + + + 300 

TTTTCTTTGTTCTTTTTGTTGAACGTGTTGCTAAGCAATTATTACAXCAATTGAAGAAAA 
301 + + + + + + 360 

GGCTCGCCGATTTATTGTTGCCAGAAAGATTCTGAGATTCTCGAAGTCGATTTTATAATA 
361 + + + + + + 420 

TTTAACCTTGGTTTTTGCATTGTTTCGTTTAAAAAAACCACTGTTTATGTGAAAAACGAT 
421 + + + + + + 480 

TAGTTTACTAATAAAACTACTTTTAAACCTTTACCTTTACCTCACCGCTCCGTGTTCATG 
431 + + + + J. + 540 

GCTCATAGATTTTCGATACTCAAATCCAAAAATAAATTTACGAGGGCAATTAATGTGAAA 
541 + + + + + + 600 

CAAAAACAATCCTAAGATTTCCACATGTTTGACCTCTCCGGCACCTTCTTCCTTAGCCCC 
601 + + + + + + 660 

ACCACTCCATCACCTCTTTGGCGGTGTTCTTCGAAACCCACTTAGGAAAGCAGTGTGXAT 
6€1 + + + + + + 720 

CTCATTTGGTATGCTCTTTTCGAXTTTATAGCTCTTTGTCGCAATTTCAATGCTTTAAAC 
721 + + + + + + 7 60 

AATCCAAATCGCATTATATTTGTGCATGGAGGCAAATGACGGGGTTGGAATCTTAGATGA 
781 + ^ + + + ^ ^ 840 

GATCAGGAGCTTTCAGGGTAAACGCCCGGTTCATTTTGTACCACATTTCATCATTTTCCT 
341 -j.- + + + + + 900 

GTCGTCCTTGGTATCCTCAACTTGTCCCGGTTTTGTTTTCGGTACACTCTTCCGTGATGC 
901 + + + + + + 960 

CACCTGTCTCCGTCTCAATTATCGTTTAGAAATGTGAACTGTCCAGATGGGTGACTCATA 
961 + + + + + + 1020 

TTGCTGCTGCTACAATCCACTTTCTTTTCTCATCGGCAGTCTTACGAGCCCATCATAAAC 
1021 + + + + + + 1080 

TTTTTTTTCCGCGAAATTTGCAATAAACCGGCCAAAAACTTTCTCCAAATTGTTACGCAA 
1081 + + + + + + 1140 

TATATACAATCCATAAGAATATCTTCTCAATGTTTATGATTTCTTCGCAGCACTTTCTCT 
1141 +-y 4. 4 + + + 1200 

TCGTGTGCTAACATCTTATTTTTATAATATTTCCGCTAAAATTCCGATTTTTGAGTATTA 
1201 + + + + + + 1260 

ATTTATCGTAAAATTATCATAATAGCACCGAAAACTACTAAAAATGGTAAAAGCTCCTTT 
1261 + + + + + + 1320 

Repeat 1 



TAAATCGGCTCGACATTATCGTATTAAGGAATCACAAAATTCTGAGAATGCGTACTGCGC 
1321 + + + -H -f 1380 



AACATATTTGACGGCAAAATATCTCGTAGCGAAAACTACAGTAATTCTTTAAATGACTAC 
1381 + + + + + + 1440 

Repeat 1 

— — — — — ^> <- — ^„ 

TGTAGCGCTTGTGTCGATTTACGGGCTCAATTTTTGAAAATAATTTTTTTTTTCGAATTT 
1441 + + + + + + 1500 
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TGATAACCCGTAAATCGTCACAACGCTACAGTAGTCATTTAAAGGATTACTGTAGTTCTA 
1501 + + + + + + 1560 



GCTACGAGAXATTTTGCGCGCCAAATAXGACTGTAATACGCAXTCTCTGAATTTTGTGTT 
1561 + + + + + + 1620 

TCCGTAATAATTTCACAAGATTTTGGCATTCCACTTTAAAGGCGCACXGGATTTATTCCA 
1621 + + + + + + 1630 

ATGGGTCTCGGCACGCAAAAAGTTTGATAGACTTTTAAATTCTCCTTGCATTTTTAATTC 
1681 + + + + + + 17 4 0 

AATTACTAAAATTTTCGTGAATTTTTCTGTTAAAATTTTTAAAATCAGTTTTCTAATATT 
17 41 + + + + + + 1800 

TTCCAGGCTGACAAACAGAAACAAAAACACAACAAACATTTTAAAAATCAGTTTTCAAAT 
1801 + + + + + + 1860 

TAAAAATAACGATTTCTCATTGAAAATTGTGTTTTATGTTTGCGAAAAXAAAAGAGAACT 
1861 + + + + + + 1920 

GATTCAAAACAAXXXXAACAAAAAAAAACCCCAAAAXTCGCCAGAAAXCAAGAXAAAAAA 
1921 + + + + + + 1980 

XXCAAGAGGGXCAAAAXXXTCCGAXXTXACTGACTXTCACCXXXXTXXXCGTAGTTCAGX 
1981 + + + + + + 2040 

GCA'GXXGTXGGAGTTXXXGACGAAAACTAGGAAAAAAAXCGAXAAAAAXTACTCAAATCG 
2041 + + + + + + 2100 

AGCTGAAXXTXGAGGACAAXGXXTAAAAAAAAACACXAXXXTXCCAATAAXTXCACXCAX 
2101 + + + + + + 2160 



XXXCAGACXAAAXCGAAAAXCAAAXCGXACTCTGACXACGGGXCAGXAGAGAGGXCAACC 

2161 + + + + + + 2220 

AXCAGCCGAAGAXGATGCGXCAAGAXAGAAGGAGCXXGCXAGAGAGGAACAXXAXGATGT 

2221 + + + + + + 223 0 

MMRQDRRSLLERNIMMF 
1 10 
T(nl040) 
{ 

XCXCXAGXCATCXAAAAGXCGAXGAAAXXCXCGAAGXXCXCAXCGCAAAACAAGXGXXGA 

2281 + + + + + + 2340 

SSHLKVDZILEVLIAKQVLN 
20 , 30 
^ lintron 1 

AXAGXGAXAAXGGAGAXAXGAXXAAXGXGAGXXTTXAAXCGAAXAAXAATXXXAAAAAAA 

2341 + + + + + + 2400 

SDNGDMIN 
40 

I 

AAXXGAXAAXAXAAAGA^XAXTXXTGCAGXCAXGXGGAACGGXXCGCGAGAAGAGACGGG 

2401 + + + + + + 2460" 

SCGXVREKRRE 
50 

A(n718) 
I 

AGAXCGXGAAAGCAGTGCAACGACGGGGAGAXGXGGCGXXCGACGCGXXXXAXGAXGCXC 
24 61 + + + + + + 2520 

IVKAVQRRGDVAFDAFYDAL 
60 70 

lintron 2 

TXCGCTCXACGGGACACGAAGGACXTGCXGAAGXXCXXGAACCXCXCGCCAGAXCGXAGG 

2521 + + + + + + 2530 

RSXGHEGLAEVLEPLARS 
80 90 
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TTTTTAAAGTTCGGCGCAAAAGCAAGGGTCTCACGGAAAAAAGAGGCGGATCGTAATTTT 
25 81 + + + + h + 2 64 0 

GCAACCCACCGGCACGGTTTTTTCCTCCGAAAATCGGAAATTATGCACTTTCCCAAATAT 
2641 + + + + + + 2700 

TTGAAGTGAAATATATTTTATTTACTGAAAGCTCGAGTGATTATTTATTTTTTAACACTA 
27 01 + + + + + + 27 60 

ATTTTCGTGGCGCAAAAGGCCATTTTGTAGATTTGCCGAAAATACTTGTCACACACACAC 
27 61 + + + + + + 2820 

i 

ACACACATCTCCTTCAAATATCCCTTTTTCCAGTGTTGACTCGAATGCTGTCGAATTCGA 

2821 + + + + + + 2880 

VDSNAVEFE 
100 

GTGTCCAATGTCACCGGCAAGCCATCGTCGGAGCCGCGCATTGAGCCCCGCCGGCTACAC 

2881 + + + + + + 2940 

CPMSPASHRRSRALSPAGYT 
110 120 

TTCACCGACCCGAGTTCACCGTGACAGCGTCTCTTCAGTGTCATCATTCACTTCTTATCA 

2941 , + + + + + + 3000 

s'p TRVHRDSVSSVSSFTSYQ 
130 140 

GGATATCTACTCAAGAGCAAGATCTCGTTCTCGATCGCGTGCACTTCATTCATCGGATCG 

3001 + + + + + + 3060 

DIYSRARSRSRSRALHSSDR 
150 160 

i intron 3 "J 
ACACAATTATTCATCTCCTCCAGTCAACGCATTTCCCAGCCAACCTTGTATGTTGATGCG 

3061 +-- + + + + 3120 

HNYSSPPVNAFPSQPS 
170 

Repeat 1 



AACACTAAATTCTGAGAATGCGCATTACTCAACATATTTGACGCGCAAATATCTCGTAGC 
3121 + + + + + + 3180 



GAAAAATACAGTAACCCTTTAAATGACTATTGTAGTGTCGATTTACGGGCTCGATTTTCG 
3181 + -i- + + + + 3240 

AAACGAATATATGCTCGAATTGTGACAACGAATTTTAATTTGTCATTTTTGTGTTTTCTT " ' 
3241 + + + + + + 3300 

Repeat 1 

TTGATATTTTTGATCAATTAATAAATTATTTCCGTAAACAGACACCAGCGCTACAGTACT 
3301 + + + + + + 3360 



CTTTTAAAGAGTTACAGTAGTTTTCGCTTCAAGATATTTTGAAAAGAATTTTAAACAT7T 
3361 + + + + + + 3420 

TGAAAAAAAATCATCTAACATGTGCCAAAACGCTTTTTTCAAGTTTCGCAGATTTTTTGA 
3421 + + + + + ^ 24Q0 
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Repeat 2 

TTTTTTTCATTCAAGATATGCTTATTAACACATATAATTATCATTAATGTGAATTTCTTG 
3481 + + + + + + 3540 



TAGAAATTTTGGGCTTTTCGTTCTAGTATGCTCTACTTTTGAAATTGCTCAACGAAAAAA 
3541 + + + + + + 3 600 



TCATGTGGTTTGTTCATATGAATGACGAAAAATAGCAATTTTTTATATATTTTCCCCTAT 
3601 + + h + + + 3660 



TCATGTTGTGCAGAAAAATAGTAAAAAAGCGCATGCATTTTTCGACATTTTTTACATCGA 
3661 + + + + + + 3720 



ACGACAGCTCACTTCACATGCTGAAGACGAGAGACGCGGAGAAATACCACACATCTTTCT 
3721 + + + + + + 3730 

Repeat 2 

<-.««.«».„..««..„„„..„«»„„„..„^.„„..„„„^^^^„ 

GCGTCTCTCGTCTTCAGCATGTGAAATGGGATCTCGGTCGATGTAAAAAAATGTCGAATA 
3781 + + + + + 3840 



ATGTAAAAAATGCATGCGTTTTTTTACACTTTTCTGCACAAATGAATAGGGGGAAAATGT 
3841 + + + + + + 3 900 



3901 



ATTAAAATACATTTTTTGTATTTTTCAACATCACATGATTAACCCCATTATTTTTTCGTT 



— + 3960 



GAGCAACTTAAAAAGTAGAGAATATTAGAGCGAAAACCAAAATTTCTTCAAGATATTACC 
3 9 61 + + + + + + 4 020 



TTTATTGATAATTATAGATGTTAATAAGCATATCTTGAATGAAAGTCAGCAAAAATATGT 
4021 + + + + + ^ 

GCGAAACACCTGAAAAAAATCAAAAATTCTGCGAAAATTGAAAAAATGCATTAAAATACA 
4081 + + + + ^ ^ 

TTTTTGCATTTTTCTACATCACATGAATGTAGAAAATTAAAAGGGAAATCAAAATTTCTA 
4141 + + + + + ^ 4200 

GAGGATATAATTGAATGAAACATTGCGAAATTAAAATGTGCGAAACGTCAAAAAAGAGGA 
4201 + + + + + ^ 42gQ 

i 

AATTTGGGTATCAAAATCGATCCTAAAACCAACACATTTCAGCATCCGCCAACTCTTCAT 
4261 + + + + ^ ^ 4320 

S A N S 3 F 
180 
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TCACCGGAXGCTCTTCTCTCGGATACAGTTCAAGTCGTAATCGCTCATTCAGCAAAGCTT 



4321 ^ + + + + + 4330 

TGCSSLGYSSSRNRSFSKAS 
190 200 

CTGGACCAACTCAATACATATTCCATGAAGAGGATATGAACTTTGTCGATGCACCAACCA 

4331 + + + + + + 4 440 

GPTQYIFHEEDMNFVDAPTI 
210 220. 

TAAGCCGTGTTTTCGACGAGAAAACCATGTACAGAAACTTCTCGAGTCCTCGTGGAATGT 

44 41 + + + + + + 4500 

SRVFDEKTMYRNFSSPRGMC 
230 240 

GCCTCATCATAAATAATGAACACTTXGAGCAGATGCCAACACGGAATGGTACCAAGGCCG 

4501 + + + + + + 4560 

LI INNEHFEQMPTRNGTKAD 
250 260 

ACAAGGACAATCTTACCAATTTGTTCAGATGCATGGGCTATACGGTTATTTGCAAGGACA 

4561 — + + + + + + 4620 

KDNLTNLFRCMGYTVICKDN 
270 280 

I intron 4 

ATCTGACGGGAAGGGTACGGCGAAATTATATTACCCAAACGCGAAAXTTGCCATTTTGCG 

4 621 + + + + h + 4 680 

L T G K 

Repeat 3 

-—--——--——-—-«-— —-———> 

CCGAAAATGTGGCGCCCGGTCTCGACACGACAATTTGTGTTAAATGCAAAAATGTATAAT 
4631 + + + + + + 4740 

TTtGCAAAAAACAAAATTTTGAACTTCCGCGAAAATGATTTACCTAGTTTCGAAATTTTC 
47 41 + + + + + + 4 800 

GTTTTTTCCGGCTACATTATGTGTTTTTTCTTAGTTTTTCTATAATATTTGATGTAAAAA 
4 301 + + + + + + 4860 

ACCGTTTGTAAATTTTCAGACAATTTTCCGCATACAAAACTTGATAGCACGAAATCAATX 
4 8 61 + + + + + + 4920 

XXCXGAAXXXXCAAAAXXAXCCAAAAATGCACAAXXXAAAAXXXGXGAAAAXTGGCAAAC 
4921 +— ' + + + + + 4 980 

GGXGXXXCAAXAXGAAAXGXAXXXXXAAAAACXXXAAAAACCACXCCGGAAAAGCAAXAA 
4981 + + + + + + 504 0 

AAAXCAAAACAACGXCACAAXXCAAAXXCAAAAGXXAXXCAXCCGAXXXGXXXAXXTXTG 
50 41 + + + + + + 5100 

CAAAAXXXGAAAAAAXCAXGAAGGAXXXAGAAAAGXXXTAXAACAXXTXTTCXAGAXTTX 
5101 + + + + + + 5160_ 

TCAAAAXXXXXTXXAACAAAXCGAGAAAAAGAGAAXGAAAAAXCGAXXXXAAAAAXAXCC 
5161 + + + + + + 5220 

Repeat 3 

ACAGCXXCGAGAGXXXGAAAXTACAGXACTCCXXAAAGGCGCACACCCCATXTGCATXGG 
5221 + + + + + + 5280 



ACCAAAAAXXXGXCGXGXCGAGACCAGGXACCGXAGXXXXXGXCGCAAAAAXTGCACCA7 
5281 + + + + + + 5340 

XGGACAAXAAACCXXCCXAAXCACCAAAAAGXAAAAXXGAAAXCXXCGAAAAGCCAAAAA 
5341 + + + + + + 5400 
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ATTCAAAAAAAAAGTCGAATTTCGATTTTTTTTTTGGTTTTTTGGTCCCAAAAACCAAAA 
54 01 + + + + + + 54 60 

AAATCAATTTTCTGCAAAATACCAAAAAGAAACCCGAAAAAATTTCCCAGCCTTGTTCCT 

54 61 + + + h + + 5520 

1 

AATGTAAACTGATATTTAATTTCCAGGGAATGCTCCTGACAATTCGAGACTTTGCCAAAC 
5521 + + + + ^ ^ 33gQ ^ 

GMLLTIRDFAKH 
290 300 

ACGAATCACACGGAGATTCTGCGATACTCGTGATTCTATCACACGGAGAAGAGAATGTGA " 
5381 + + + + ^ + 5 540 

ESHGDSAILVILSHGEEKVI 

310 320 

TTATTGGAGTTGATGATATACCGATTAGTACACACGAGATATATGATCTTCTCAACGCGG 
5 641 + + + + + + 5700 

IGVDDIPISTHEIYDLLNAA 

330 340 

A(n2433) 

i i intron 5 

CAAATGCTCCCCGTCTGGCGAATAAGCCGAAAATCGTTTTTGTGCAGGCTTGTCGAGGCG 
5701 + ^ ^ ^ ^ ^ 

NAP. RLANKPKIVFVQACRGE 

350 360 



GTTCGTTTTTTATTTTAATTTTAATATAAATATTTTAAATAAATTCATTTTCAGAACGTC 
+ + + + + + 

R R 



GTGACAATGGATTCCCAGTCTTGGATTCTGTCGACGGAGTTCCTGCATTTCTTCGTCGTG 
5821 + + + ^ + + 5Qg0 

DNGFPVLDSVDGVPAFLRRG 
370 380 

T(nll65} 

GATGGGACAATCGAGACGGGCCATTGTTCAATTTTCTTGGATGTGTGCGGCCGCAAGTTC 
5381 +— + + + + + 554Q 

WDNR'DGPLFNFLGCVRPQVQ 
390 400 

[ intron 6 

AGGTTGCAATTTAATTTCTTGAATGAGAATATTCCTTCAAAAAATCTAAAATAGATTTTT 
5941 ^ ^ ^ ^ ^ ^ g^QQ 

ATTCCAGAAAGTCCCGATCGAAAAATTGCGATATAATTACGAAATTTGTGAXAAAATGAC 

6001 4 + + + + ^ gQgQ 

Repeat 4 

AAACCAATCAGCATCGTCGATCTCCGCCCACTTCATCGGATTGGTTTGAAAGTGGGCGGA 
6061 + + ^ + + ^ g^20 

GTGAATTGCTGATTGGTCGCAGTTT7CAGTTTAGAGGGAATTTAAAAATCGCCTTTTCGA 
6121 + + + + ^ ^ g^gQ 

AAATTAAAAATTGATTTTTTCAATTTTTTCGAAAAATATTCCGATTATTTTATATTCTTT 
6131 + ^ ^ ^ ^ ^ ,240 
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ACn717) 
I 

GGAGCGAAAGCCCCGTCCTGTAAACATTTTTAAATGATAATTAATAAATTTTTGCAGCAA 
6241 + H- + + + + €300 

Q 

T{nl949) 
I 

GTGTGGAGAAAGAAGCCGAGCCAAGCTGACATTCTGATTCGATACGCAACGACAGCTCAA 
€301 + + + + + + 6360 

VWRKKPSQADILIRYATTAQ 
410 420 

A(nl286) 
I 

TATGTTTCGTGGAGAAACAGTGCTCGTGGATCATGGTTCATTCAAGCCGTCTGTGAAGTG 
63 61 + + + + + ^ g420 

YVSWRNSARGSWFI QAVCEV 
430 440 

T (nll29,nll64) 
1 

TTCTCGACACACGCAAAGGATATGGATGTTGTTGAGCTGCTGACTGAAGTCAATAAGAAG 
6421 + + + + ^ 6480 

FST-KAKDMDVVELLTEVNKK 
450 460 

^(^2430) A(n242S) 
1 I i intron 7 
GTCGCTTGTGGATTTCAGACATCACAGGGATCGAATATTTTGAAACAGATGCCAGAGGTA 
6481 + + + + + + ^6540 

VACG^FQTSQGSNILKQMPE 
470 480 

Repeat 5 



CTTGAAACAAACAATGCATCTCTAACTrTTAAGGACACAGAAAAATAGGCAGAGGCTCCT 
6541 + + + + + + €600 



TTTGCAAGCCTGCCGCGCGTCAACCTAGAATTTTAGTTTTTAGCTAAAATGATTGATTTT 
6601 + + + 4. + 6660 

GAATATTTTATGCTAATTTTT7TGCGTTAAATTTTGAAATAGTCACTATTTATCGGGTTT 
6661 + + + + + + 6720 

CCAGTAAAAAATGTTTATTAGCCATTGGATTTTACTGAAAACGAAAATTTGTAGTTTTTC 
6721 + + + + + + 6780 

AACGAAATTTATCGATTTTTAAATGTAAAAAAAAATAGCGAAAATTACATCAACCAtCAA 
67 81 + + + + + + 684^ 

GCATTTAAGCCAAAATTGTTAACTCAT7TAAAAATTAATTCAAAGTTGTCCACGAGTATT 
6841 + + + + + + 6900 

Repeat 5 

ACACGGTTGGCGCGCGGCAAGTTTGCAAAACGACGCTCCGCCTCTTTTTCTGTGCGGCTT 

6901 + + + + ^ ^ gg^Q 



T(nll63) 

"~" i I 

GAAAACAAGGGATCGGTTTAGATTTTTCCCCAAAATTTAAATTAAATTTCAGAtGACATC 
6961 + + + + + ^ ^Q20 

M T S 
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1 



CCGCCTGCTCAAAAAGTTCTACTTTTGGCCGGAAGCACGAAACTCTGCCGTCTAAAATTC 

7021 + + »• + + + 7080 

RLLKKFYFWPEARKSAV* 
490 500 

ACTCGTGATTCATTGCCCAATTGATAATTGTCTGTATCTTCTCCCCCAGTTCTCTTTCGC 
7081 + (■ + + + + 714 0 

CCAATTAGTTTAAAACCATGTGTATATTGTTATCC7ATACTCATTTCACTTTATCATTCT 
7141 + + + 1- (• + 7200 

ATCATTTCTCTTCCCATTTTCACACATTTCCATTTCTCTACGATAATCTAAAATTATGAC 
7201 + + + + + 7260 

GTTTGTGTCTCGAACGCATAATAATTTTAAXAACTCGTTTTGAATTTGATTAGTTGTTGT 
7261 + h + + + + 7320 

GCCCAGTATATATGTATGTACTATGCTTCTATCAACAAAATAGTTTCATAGATCATCACC 
7321 + + + + + + 7380 

CCAACCCCACCAACCTACCGTACCATATTCATTTTTGCCGGGAATCAATTTCGATTAATT 
7381 + + + + + + 7 4 40 

TTAACCTATTTTTTCGCCACAAAAAATCTAATATTTGAATTAACGAATAGCATTCCCATC 
7441 + + + + + + 7500 

TCTCCCGTGCCGGAATGCCTCCCGGCCTTTTAAAGTTCGGAACATTTGGCAATTATGTAT 
7501 + + + + + + 7560 

AAATTTGTAGGTCCCCCCCATCATTTCCCGCCCATCATCTCAAATTGCATTCTTTTTTCG 
7561 + + + + h + 7620 

CCGTGATATCCCGATTCTGGTCAGCAAAGATCT 
7621 + + 7653 
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Line 2 C. Jbriggsae 
Line 3 C- vulgaris 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Declaration for Patent Application 

As a named inventor, I hereby declare that: 

My residence, post office address and citizenship are as 
stated next to my name; 

I believe I am the original, first and sole inventor (if 
only one name is listed) or an' original, first and joint inventor 
(if plural names are listed in the signatory page(s) commencing 
at page 3 hereof) of the subject matter which is claimed and for 
which a patent is sought on the invention entitled 

CLONING. SEQUENCING AND CHARACTERIZATION OF TWO CELL DEATH GENES 



AND USES THEREFOR 



the specification of which (check one) 
/ / is attached hereto. 

I'Yj was filed on November 20. 199 2 as 

Application Serial No. 07/979.638 . 

and was amended on (if applicable) . 

I hereby state that I have reviewed and understand the 
contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is 
known by me to be material to patentability as defined in 37 
C.F.R. Sl-56- 

I hereby claim foreign priority benefits under Title 35 ^ 
United States Code, §119 of any foreign application (s) for patent 
or inventor's certificate listed below and have also identified 
below any foreign application for patent or inventor's 
certificate having a filing date before that of the application 
on which priority is claimed: 



Prior Foreign Application fs) 



(Number) 


(Country) 


(Day/Month/Year 


filed) 


(Number) 


(Country) 


(Day/Month/Year 


filed) 


(Number) 


(Country) 


(Day/Month/Year 


filed) 



Priority 
Claimed 

Yes No 

/Z7 IZJ 
Yes No 

l~ lU 
Yes No 
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I hereby claim the benefit under Title 35, United States Code, 
S120 of any United States application (s) listed below and, 
insofar as the subject matter of each of the claims of this , 
application is not disclosed in the prior United States 
application in the manner provided by the first paragraph of 
Title 35, United States Code, 5112, I acknowledge the duty to,^ 
disclose information known by me to be material to patentability 
as defined in 37 C.F.R. Sl«56 which occurred between the filing 
date of the prior application and the national or PCT 
international filing date of this application: 



07/897,788 



(Application Serial No*) 



June 12, 1992 



pending 



(Filing date) (Status, patented, 
pending , abandoned ) 



(Application Serial No-) 



(Filing date) (Status, patented, 
pending , abandoned) 



As a named inventor, I hereby appoint the following 
attorney (s) and/or agent (s) to prosecute this application and 
transact all business in the Patent and Trademark Office 
connected therewith. 



I also hereby grant additional Powers of Attorney to the 
following attorney (s) and/or agent (s) to file and prosecute an 
international application under the Patent Cooperation Treaty 
based upon the above-identified application, including a' power to 
meet all designated office requirements for designated states. 



David E. Brook 
James M. Smith 
Leo R. Reynolds 
Richard A. -Wise 
Patricia Granahan 
Mary Lou Wakimura 
Thomas 0. Hoover 
Alice O. Carroll 
Pei-Pei C, Smith 
all of Hamilton, Brook, 
Drive, Lexington, Massa 



Registration No. 22,592 
Registration No, 28,043 
Registration No. 20,884 
Registration No. 18,041 
Registration No. 32,227 
Registration No. 31,8 04 
Registration No. 32,470 
Registration No. 33,542 
Registration No. 36,280 
Smith and Reynolds, P.C, Two Militia 
chusetts 02173; 



and 



Send correspondence to: 



Patricia Granahan, Esq. 

HAMILTON. BROOK. SMITH & REYNO LDS, P-C. 
Two Militia Drive 

Lexington, Massachuset ts 0 2 1 7_3 - 4 7 9 9 



Patricia Granahan, Esq. 
Direct telephone calls to: (617) 861-6240 



I hereby declare that all statements made herein of my own 
knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these 
statements vere made with the knowledge that willful false 
statements and the like so made are punishable by fine or 
imprisonment, or both, tinder Section 1001 of Title 18 of the 
United States Code and that such willful false statements may 
jeopardize the validity of the application or any patent issued 
thereon. 



Full name of sole 

or first inventor H. Robert Horvitz 

Signature Date Jy^^JyV^-O 2£ /7l ^ 

Residence 8 0 Wendejl Street, #2 ' 

Cambridge, Massachusetts 02138 

Citizenship U.S.A. 

Post Office Address^^ same 



Full name of second joint 

inventor, if any Junvina Yuan 

Second Inventor^s *^ v . 

Signature X — ^ " — Date ^h^[^\^ 

Residence 2 2 Fair lee Road 

_^ : Newton, Massachusetts 02168 

Citizenship china 

Post Office Address same 



Full name of third joint 

inventor, if any_^ Shai .^haham 

Third Inventor's iTT [ T|? 7 

Signature ^Jjc^^^ 

Residence 



Dat e l/2-6/f^ 



132 Commonwealth Avenue, ^9 



Boston r Massachusetts 02116 



CitizenshXD 



Israel and U.S.A. 



Post Office Address 



same 



Full name of fourth joint 

inventor, if any 

Fourth Inventor's 

Signature Date 

Residence 



Citizenship 



Post Office Address 



PATENT 
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Certificate of Mailing: Date of Deposit: August 20. 1998 

I hereby certify under 37 CFR L8(a) that this correspondence is being deposited with the United States Postal 
Service as first class mail with sufficient postage on the date indicated above and is addressed to: Assistant 
Commissioner of Patents, Washington, D.C, 2023 1 . 



Mary Jane DiPalma 



Printed name of person mailing correspondence 



Signaturey)f person mailing correspondence 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant: 
Serial No.: 
Filed: 
Title: 



H. Robert Horvitz et al. 
08/984,178 
December 3, 1997 



Art Unit: 
Examiner: 



CLONING SEQUENCE^G AND CHARACTERIZATION OF TWO CELL 
DEATH GENES AND USES THEREFOR 



Assistant Commissioner of Patents 
Washington, D.C. 20231 

CHANGE OF ADDRESS 

Effective immediately, please forward all correspondence in the above appUcation to the 

undersigned attorney of record at: 

Clark & Elbing LLP 
176 Federal Street 
Boston, MA 02110-2214 
Telephone: 617-428-0200 
Facsimile: 617-428-7045 



If there are any charges, or any credits, please apply them to Deposit Account No. 03- 



2095. 




Date: 



Clark & Elbing LLP 
176 Federal Street 
Boston, MA 02110 
Telephone: 617-428-0200 
Facsimile: 617-428-7045 



